ML Wiki
Search
Search
Explorer
Tag: llm
6 items with this tag.
Apr 21, 2026
GPT-4 Technical Report
source
llm
scaling
alignment
multimodal
rlhf
Apr 18, 2026
Code Generation
concept
code
llm
fine-tuning
Apr 18, 2026
Mistral 7B
source
architecture
llm
inference-efficiency
attention
Apr 17, 2026
How LLMs Are Trained — From Scratch to RLHF
learning-path
training
llm
Apr 17, 2026
Llama 2: Open Foundation and Fine-Tuned Chat Models
source
pre-training
rlhf
sft
gqa
alignment
llm
meta-ai
Apr 10, 2026
Training language models to follow instructions with human feedback (InstructGPT)
source
alignment
rlhf
llm
fine-tuning
safety