ML Wiki

Tag: scaling-laws

6 items with this tag.

  • May 09, 2026

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    • source
    • small-language-models
    • data-quality
    • synthetic-data
    • scaling-laws
    • edge
  • May 04, 2026

    Training Compute-Optimal Large Language Models (Chinchilla)

    • source
    • scaling
    • compute-optimal-training
    • scaling-laws
    • pre-training
    • chinchilla
  • Apr 18, 2026

    Evaluating Large Language Models Trained on Code (Codex)

    • source
    • code-generation
    • pre-training
    • fine-tuning
    • sampling
    • scaling-laws
  • Apr 18, 2026

    Scalable Diffusion Models with Transformers

    • source
    • diffusion
    • vision-transformer
    • scaling-laws
    • latent-space
    • architecture
  • Apr 17, 2026

    Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (T5)

    • source
    • transfer-learning
    • pre-training
    • encoder-decoder
    • fine-tuning
    • scaling-laws
    • nlp
  • Apr 10, 2026

    Scaling Laws for Neural Language Models

    • source
    • scaling
    • compute
    • scaling-laws
    • pretraining
    • language-models