ML Wiki
Search
Search
Explorer
Tag: training
11 items with this tag.
Apr 11, 2026
Transfer Learning
concept
training
fine-tuning
Apr 10, 2026
How LLMs Are Trained — From Scratch to RLHF
learning-path
training
llm
Apr 10, 2026
Alignment (AI)
concept
alignment
training
Apr 10, 2026
Contrastive Learning
concept
training
self-supervised
Apr 10, 2026
PPO (Proximal Policy Optimization)
concept
reinforcement-learning
training
alignment
Apr 10, 2026
Reward Model
concept
alignment
training
rlhf
Apr 10, 2026
Scaling Laws
concept
training
scaling
Apr 04, 2026
Distillation (Knowledge Distillation)
concept
training
efficiency
Apr 04, 2026
DPO (Direct Preference Optimization)
concept
alignment
training
Apr 04, 2026
RLHF (Reinforcement Learning from Human Feedback)
concept
alignment
training
Apr 04, 2026
SFT (Supervised Fine-Tuning)
concept
training
alignment