Question 1

What is a foundation model?

Accepted Answer

A foundation model is a large model trained on broad, general data that can be adapted to many different downstream tasks. Rather than building a separate model per task, you start from one powerful base and specialise it.

Question 2

How is a foundation model trained?

Accepted Answer

It is pre-trained on enormous amounts of unlabelled data using self-supervised objectives like next-token prediction. This single, expensive pre-training run produces general-purpose capabilities that many applications can reuse.

Question 3

What are emergent capabilities?

Accepted Answer

Emergent capabilities are skills that appear only once a model passes a certain scale and were not present in smaller versions, such as multi-step reasoning or in-context learning. They are a key reason scaling foundation models has been so impactful.

Question 4

What is the difference between a foundation model and a large language model?

Accepted Answer

A large language model is a foundation model whose data is text, while foundation models more broadly can also cover images, audio, code or multiple modalities at once. Every LLM is a foundation model, but not every foundation model is text-only.

What Is a Foundation Model?

What a foundation model is

Pre-training: learn from everything

Adaptation: specialise for anything

Emergent capabilities at scale

Why foundation models matter