Published inBetter MLTraining Small Language Models on a BudgetList of training optimizations!Apr 20Apr 20
Published inBetter MLWill it scale to N GPUs ?Multi-node training & cluster network bandwidthApr 1Apr 1
Published inBetter MLPerf model cardsModel cards are metadata for trained ML models that provide benchmarked evaluation and performance characteristics. It is an effective…Feb 121Feb 121
Published inBetter MLLLM serving challengesDiscussing the unique serving challenges of LLMsJan 271Jan 271
Published inBetter MLCheckpointing for distributed training failuresChallenges in checkpointingDec 16, 2023Dec 16, 2023
Published inBetter MLModel quality & compute budgetA scaling law is a mathematical formula that describes how properties of a system change with the size/scale of a system. Recent work in…Oct 16, 2023Oct 16, 2023
Published inBetter MLOffline Batch Inference for large modelsBatching up for profit!Aug 13, 20231Aug 13, 20231