ML Wiki

Tag: ai-feedback

2 items with this tag.

  • May 07, 2026

    Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

    • source
    • rag
    • retrieval
    • in-context-learning
    • instruction-following
    • ai-feedback
    • long-context
  • May 05, 2026

    Self-Rewarding Language Models

    • source
    • alignment
    • rlhf
    • dpo
    • reward-model
    • sft
    • ai-feedback