Jaideep Ray – Medium

Jaideep Ray

Jaideep Ray
in
Better ML

Training Small Language Models on a Budget

List of training optimizations!

Apr 20

Training Small Language Models on a Budget

Apr 20

Jaideep Ray
in
Better ML

Will it scale to N GPUs ?

Multi-node training & cluster network bandwidth

Apr 1

Will it scale to N GPUs ?

Apr 1

Jaideep Ray
in
Better ML

Perf model cards

Model cards are metadata for trained ML models that provide benchmarked evaluation and performance characteristics. It is an effective…

Feb 12

Perf model cards

Feb 12

Jaideep Ray
in
Better ML

NV Triton inference server’s concurrent execution

Feb 5

NV Triton inference server’s concurrent execution

Feb 5

Jaideep Ray

Thank you. Sure, I will take an attempt at both of them.

Feb 5

Feb 5

Jaideep Ray
in
Better ML

LLM serving challenges

Discussing the unique serving challenges of LLMs

Jan 27

LLM serving challenges

Jan 27

Jaideep Ray
in
Better ML

Portability of AI stack

Context:

Jan 22

Portability of AI stack

Jan 22

Jaideep Ray
in
Better ML

Checkpointing for distributed training failures

Challenges in checkpointing

Dec 16, 2023

Checkpointing for distributed training failures

Dec 16, 2023

Jaideep Ray
in
Better ML

Model quality & compute budget

A scaling law is a mathematical formula that describes how properties of a system change with the size/scale of a system. Recent work in…

Oct 16, 2023

Model quality & compute budget

Oct 16, 2023

Jaideep Ray
in
Better ML

Offline Batch Inference for large models

Batching up for profit!

Aug 13, 2023

Offline Batch Inference for large models

Aug 13, 2023

Jaideep Ray

Jaideep Ray

Engineer | ML Platforms | Model lifecycle| https://www.linkedin.com/in/jaideepray/

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams