ML Wiki

Tag: fine-tuning

12 items with this tag.

  • Apr 25, 2026

    Learning to Summarize from Human Feedback

    • source
    • rlhf
    • alignment
    • reward-model
    • ppo
    • sft
    • fine-tuning
  • Apr 20, 2026

    BART: Denoising Sequence-to-Sequence Pre-training

    • source
    • encoder-decoder
    • pre-training
    • denoising
    • fine-tuning
    • masked-language-model
  • Apr 18, 2026

    Code Generation

    • concept
    • code
    • llm
    • fine-tuning
  • Apr 18, 2026

    Evaluating Large Language Models Trained on Code (Codex)

    • source
    • code-generation
    • pre-training
    • fine-tuning
    • sampling
    • scaling-laws
  • Apr 17, 2026

    Instruction Following

    • concept
    • alignment
    • sft
    • fine-tuning
  • Apr 17, 2026

    QLoRA: Efficient Finetuning of Quantized LLMs

    • source
    • quantization
    • lora
    • fine-tuning
    • memory-efficiency
    • inference-efficiency
  • Apr 17, 2026

    Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (T5)

    • source
    • transfer-learning
    • pre-training
    • encoder-decoder
    • fine-tuning
    • scaling-laws
    • nlp
  • Apr 11, 2026

    Transfer Learning

    • concept
    • training
    • fine-tuning
  • Apr 10, 2026

    BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

    • source
    • bert
    • pretraining
    • bidirectional
    • nlp
    • fine-tuning
    • masked-lm
  • Apr 10, 2026

    Training language models to follow instructions with human feedback (InstructGPT)

    • source
    • alignment
    • rlhf
    • llm
    • fine-tuning
    • safety
  • Apr 05, 2026

    LoRA (Low-Rank Adaptation)

    • concept
    • fine-tuning
    • peft
  • Apr 05, 2026

    LoRA: Low-Rank Adaptation of Large Language Models

    • source
    • fine-tuning
    • lora
    • peft
    • adaptation