ML Wiki

Tag: moe

2 items with this tag.

  • Apr 16, 2026

    Switch Transformers: Scaling to Trillion Parameter Models with Sparse MoE

    • source
    • mixture-of-experts
    • moe
    • scaling
    • efficiency
    • sparse
  • Apr 10, 2026

    Mixtral of Experts

    • source
    • mixtral
    • mixture-of-experts
    • moe
    • sparse-moe
    • inference-efficiency
    • open-weights