Question 1

What is self-supervised learning?

Accepted Answer

Self-supervised learning creates training labels automatically from the structure of unlabelled data, such as hiding part of the input and predicting it. This lets models learn from huge amounts of raw text or images without any human annotation.

Question 2

How is self-supervised learning different from unsupervised learning?

Accepted Answer

Both use unlabelled data, but self-supervised learning invents a supervised-style prediction task from the data itself, like filling in a missing word. Classic unsupervised methods instead group or compress data without predicting a target.

Question 3

What is a pretext task?

Accepted Answer

A pretext task is a made-up prediction problem whose answer comes free from the data, such as predicting the next token or a masked word. Solving the pretext task forces the model to learn general representations that transfer to real downstream tasks.

Question 4

Why is self-supervised learning so important for large models?

Accepted Answer

It removes the bottleneck of human labelling, so models can train on essentially the entire web. That scale of data is what gives modern foundation models like GPT and BERT their broad knowledge and capabilities.

What Is Self-Supervised Learning?

What self-supervised learning is

Pretext tasks: free labels from raw data

Next-token prediction (GPT)

Masked prediction (BERT)

Contrastive learning (CLIP)

Why it changed AI