Jaideep RayinBetter MLTraining Small Language Models on a BudgetList of training optimizations!Apr 20Apr 20
Jaideep RayinBetter MLWill it scale to N GPUs ?Multi-node training & cluster network bandwidthApr 1Apr 1
Jaideep RayinBetter MLPerf model cardsModel cards are metadata for trained ML models that provide benchmarked evaluation and performance characteristics. It is an effective…Feb 121Feb 121
Jaideep RayinBetter MLLLM serving challengesDiscussing the unique serving challenges of LLMsJan 271Jan 271
Jaideep RayinBetter MLCheckpointing for distributed training failuresChallenges in checkpointingDec 16, 2023Dec 16, 2023
Jaideep RayinBetter MLModel quality & compute budgetA scaling law is a mathematical formula that describes how properties of a system change with the size/scale of a system. Recent work in…Oct 16, 2023Oct 16, 2023
Jaideep RayinBetter MLOffline Batch Inference for large modelsBatching up for profit!Aug 13, 20231Aug 13, 20231