Jaideep RayinBetter MLTraining Small Language Models on a BudgetList of training optimizations!2 min read·Apr 20, 2024----
Jaideep RayinBetter MLWill it scale to N GPUs ?Multi-node training & cluster network bandwidth5 min read·Apr 1, 2024----
Jaideep RayinBetter MLPerf model cardsModel cards are metadata for trained ML models that provide benchmarked evaluation and performance characteristics. It is an effective…2 min read·Feb 12, 2024--1--1
Jaideep RayinBetter MLLLM serving challengesDiscussing the unique serving challenges of LLMs5 min read·Jan 27, 2024--1--1
Jaideep RayinBetter MLCheckpointing for distributed training failuresChallenges in checkpointing3 min read·Dec 16, 2023----
Jaideep RayinBetter MLModel quality & compute budgetA scaling law is a mathematical formula that describes how properties of a system change with the size/scale of a system. Recent work in…2 min read·Oct 16, 2023----
Jaideep RayinBetter MLOffline Batch Inference for large modelsBatching up for profit!3 min read·Aug 13, 2023--1--1