ML Wiki
Search
Search
Explorer
Tag: ai-feedback
2 items with this tag.
May 07, 2026
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
source
rag
retrieval
in-context-learning
instruction-following
ai-feedback
long-context
May 05, 2026
Self-Rewarding Language Models
source
alignment
rlhf
dpo
reward-model
sft
ai-feedback