Emergent Behavior (LLMs)

What It Is

Emergent behaviors are capabilities that appear in large models but are absent in smaller ones — and cannot be predicted by extrapolating from smaller scale. Performance is flat (near-random) for many orders of magnitude of compute, then suddenly jumps at a threshold.

Why It Matters

It means scaling is not always smooth. Some capabilities you can’t buy incrementally — you either have them or you don’t. This makes capability prediction hard, and is central to AI safety debates about whether dangerous capabilities could appear suddenly at scale.

Examples

3-digit arithmetic: absent below ~13B params, sharp jump above
Chain-of-thought effectiveness: hurts below ~68B, helps above
Instruction following: hurts below ~8B when fine-tuned, helps above
Multilingual translation: absent in small models, appears at scale

The Controversy

Some researchers argue emergence is a measurement artifact: if you use a finer-grained metric, the improvement is gradual. Others argue the phase transitions are real. The debate is unresolved.

Key Sources

emergent-abilities-of-large-language-models — the paper that defined and catalogued emergence across model families
scaling-laws-for-neural-language-models — establishes the smooth power laws that emergent behavior punctuates
gpt-4-technical-report — reverses inverse scaling on Hindsight Neglect; human-level performance on professional exams as emergent capability
emergent-world-representations-othello-gpt — complementary angle: world models emerge from sequence training even without explicit supervision, evidence of genuine internal structure rather than surface statistics
training-compute-optimal-large-language-models — Chinchilla’s smooth power-law loss curves are evidence for the metric-artifact interpretation: if the underlying capability grows smoothly, sharp task transitions likely reflect nonlinear evaluation rather than genuine phase changes

scaling-laws
in-context-learning
chain-of-thought
grokking — another form of sudden phase transition: generalization appearing long after training loss converges

ML Wiki

Explorer

Emergent Behavior (LLMs)

What It Is

Why It Matters

Examples

The Controversy

Key Sources

Graph View

Table of Contents

Backlinks

ML Wiki

Explorer

Emergent Behavior (LLMs)

What It Is

Why It Matters

Examples

The Controversy

Key Sources

Related Concepts

Graph View

Table of Contents

Backlinks