ML Wiki

Tag: efficiency

8 items with this tag.

Apr 27, 2026
Long Context
Apr 24, 2026
Dynamic Computation
Apr 22, 2026
Model Compression
Apr 17, 2026
Making LLMs Fast — The Inference Efficiency Stack
Apr 16, 2026
Mixture of Experts (MoE)
Apr 16, 2026
Switch Transformers: Scaling to Trillion Parameter Models with Sparse MoE
Apr 10, 2026
LLaMA: Open and Efficient Foundation Language Models
Apr 04, 2026
Distillation (Knowledge Distillation)