Jaideep Ray – Medium

Jaideep Ray
in
Better ML

Training Small Language Models on a Budget

List of training optimizations!

2 min readApr 20, 2024

--

Training Small Language Models on a Budget

--

Jaideep Ray
in
Better ML

Will it scale to N GPUs ?

Multi-node training & cluster network bandwidth

5 min readApr 1, 2024

--

Will it scale to N GPUs ?

--

Jaideep Ray
in
Better ML

Perf model cards

Model cards are metadata for trained ML models that provide benchmarked evaluation and performance characteristics. It is an effective…

2 min readFeb 12, 2024

--

1

Perf model cards

--

1

Jaideep Ray
in
Better ML

NV Triton inference server’s concurrent execution

2 min readFeb 5, 2024

--

NV Triton inference server’s concurrent execution

--

Jaideep Ray

Thank you. Sure, I will take an attempt at both of them.

1 min readFeb 5, 2024

--

--

Jaideep Ray
in
Better ML

LLM serving challenges

Discussing the unique serving challenges of LLMs

5 min readJan 27, 2024

--

1

LLM serving challenges

--

1

Jaideep Ray
in
Better ML

Portability of AI stack

Context:

4 min readJan 22, 2024

--

Portability of AI stack

--

Jaideep Ray
in
Better ML

Checkpointing for distributed training failures

Challenges in checkpointing

3 min readDec 16, 2023

--

Checkpointing for distributed training failures

--

Jaideep Ray
in
Better ML

Model quality & compute budget

A scaling law is a mathematical formula that describes how properties of a system change with the size/scale of a system. Recent work in…

2 min readOct 16, 2023

--

Model quality & compute budget

--

Jaideep Ray
in
Better ML

Offline Batch Inference for large models

Batching up for profit!

3 min readAug 13, 2023

--

1

Offline Batch Inference for large models

--

1

Jaideep Ray

Jaideep Ray

Engineer | ML Platforms | Model lifecycle| https://www.linkedin.com/in/jaideepray/

Following

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams