Skip to content
M

Megatron-DeepSpeed

Project ID: 952

Ongoing research training transformer language models at scale, including: BERT & GPT-2