Self-Supervised Learning

What It Is

A training paradigm where a model learns representations from unlabeled data by creating its own supervisory signal — predicting one part of the data from another, or recognizing that two views of the same data should agree.

Why It Matters

Labels are expensive; unlabeled data is abundant. Self-supervised learning lets models build rich, transferable representations from raw data at scale, often matching or exceeding supervised pretraining when fine-tuned on small labeled sets.

How It Works

The model is given a pretext task constructed from the data itself: reconstruct masked patches, predict the next token, or agree on two augmented views of the same image. The labels come from the data structure, not human annotation. After pretraining, the learned representations transfer to downstream tasks via fine-tuning or linear probing.

Key Sources

simclr-contrastive-learning-visual-representations
mae-masked-autoencoders-scalable-vision-learners
dino-self-supervised-vision-transformers
dinov2-learning-robust-visual-features
toolformer-language-models-teach-themselves-tool-use
alphafold-2-protein-structure-prediction — uses evolutionary co-mutation signals from unlabeled sequence databases as self-supervised input features for protein structure prediction
satmae-pretraining-transformers-temporal-multispectral-satellite-imagery

ML Wiki

Explorer

Self-Supervised Learning

What It Is

Why It Matters

How It Works

Key Sources

Graph View

Table of Contents

Backlinks

ML Wiki

Explorer

Self-Supervised Learning

What It Is

Why It Matters

How It Works

Key Sources

Related Concepts

Graph View

Table of Contents

Backlinks