ML Wiki

Tag: training

11 items with this tag.

  • Apr 11, 2026

    Transfer Learning

    • concept
    • training
    • fine-tuning
  • Apr 10, 2026

    How LLMs Are Trained — From Scratch to RLHF

    • learning-path
    • training
    • llm
  • Apr 10, 2026

    Alignment (AI)

    • concept
    • alignment
    • training
  • Apr 10, 2026

    Contrastive Learning

    • concept
    • training
    • self-supervised
  • Apr 10, 2026

    PPO (Proximal Policy Optimization)

    • concept
    • reinforcement-learning
    • training
    • alignment
  • Apr 10, 2026

    Reward Model

    • concept
    • alignment
    • training
    • rlhf
  • Apr 10, 2026

    Scaling Laws

    • concept
    • training
    • scaling
  • Apr 04, 2026

    Distillation (Knowledge Distillation)

    • concept
    • training
    • efficiency
  • Apr 04, 2026

    DPO (Direct Preference Optimization)

    • concept
    • alignment
    • training
  • Apr 04, 2026

    RLHF (Reinforcement Learning from Human Feedback)

    • concept
    • alignment
    • training
  • Apr 04, 2026

    SFT (Supervised Fine-Tuning)

    • concept
    • training
    • alignment