ML Wiki

Tag: safety

4 items with this tag.

Apr 17, 2026
Constitutional AI (CAI)
Apr 17, 2026
Harmlessness (AI alignment)
Apr 17, 2026
Constitutional AI: Teaching Models to Self-Correct
Apr 10, 2026
Training language models to follow instructions with human feedback (InstructGPT)