ML Wiki

Tag: thread

5 items with this tag.

  • Apr 28, 2026

    Are emergent abilities real or a metric artifact?

    • thread
    • scaling
    • emergent-abilities
    • evals
  • Apr 28, 2026

    Does DPO scale reliably past 70B?

    • thread
    • alignment
    • scaling
  • Apr 28, 2026

    What's the right alignment stack post-RLHF?

    • thread
    • alignment
    • rlhf
    • dpo
  • Apr 28, 2026

    When does long context actually fail?

    • thread
    • long-context
    • evals
  • Apr 28, 2026

    Where does RL-on-verifiable-rewards stop generalizing?

    • thread
    • reasoning
    • rl
    • deepseek-r1