Home / Transit / Megatron-DeepSpeed Megatron-DeepSpeed Combining Megatron-LM and DeepSpeed for efficient large-scale training Package 1.4k stars GitHub Back to Transit