ML Wiki

Tag: llm

2 items with this tag.

  • Apr 10, 2026

    How LLMs Are Trained — From Scratch to RLHF

    • learning-path
    • training
    • llm
  • Apr 10, 2026

    Training language models to follow instructions with human feedback (InstructGPT)

    • source
    • alignment
    • rlhf
    • llm
    • fine-tuning
    • safety