ML Wiki

Tag: llm

6 items with this tag.

  • Apr 21, 2026

    GPT-4 Technical Report

    • source
    • llm
    • scaling
    • alignment
    • multimodal
    • rlhf
  • Apr 18, 2026

    Code Generation

    • concept
    • code
    • llm
    • fine-tuning
  • Apr 18, 2026

    Mistral 7B

    • source
    • architecture
    • llm
    • inference-efficiency
    • attention
  • Apr 17, 2026

    How LLMs Are Trained — From Scratch to RLHF

    • learning-path
    • training
    • llm
  • Apr 17, 2026

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    • source
    • pre-training
    • rlhf
    • sft
    • gqa
    • alignment
    • llm
    • meta-ai
  • Apr 10, 2026

    Training language models to follow instructions with human feedback (InstructGPT)

    • source
    • alignment
    • rlhf
    • llm
    • fine-tuning
    • safety