Question 1

What does LLM stand for?

Accepted Answer

LLM stands for Large Language Model. It is a neural network — almost always a transformer — trained on a very large corpus of text to predict the next token, and "large" refers both to the size of the training data and to the number of parameters, typically billions or more.

Question 2

How is an LLM different from older language models?

Accepted Answer

Earlier language models were small and narrow, trained for a single task such as sentiment classification or translation. An LLM is trained on broad web-scale text with one generic objective, which lets a single model handle many tasks it was never explicitly trained for, from summarising to coding to answering questions.

Question 3

What are emergent capabilities?

Accepted Answer

Emergent capabilities are abilities that appear abruptly once a model passes a certain size or training threshold, rather than improving smoothly. Examples include multi-step arithmetic, following few-shot instructions, and chain-of-thought reasoning, none of which the model was directly trained to do.

Question 4

Does an LLM understand language the way humans do?

Accepted Answer

No. An LLM has no grounded experience of the world; it models statistical patterns in text. It can produce fluent, useful, and often correct output, but it has no beliefs or intentions and can state false claims confidently, which is why its output should be verified for anything important.

LLM — Large Language Model (AI Glossary)

What an LLM is

How it is trained

Why scale matters: emergent capabilities

What LLMs can and cannot do

Where the term sits