ML Wiki

Tag: tensor-parallel

1 item with this tag.

  • May 09, 2026

    Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

    • source
    • distributed-training
    • model-parallel
    • tensor-parallel
    • pre-training
    • systems