Question 1

What does an LLM actually do under the hood?

Accepted Answer

At its core an LLM predicts the next token given the tokens so far. It outputs a probability for every possible next token, one is chosen, it is appended, and the process repeats. Everything that looks like reasoning, writing, or answering is built on that single repeated guess.

Question 2

What is the difference between pre-training and fine-tuning?

Accepted Answer

Pre-training is the expensive phase where the model learns language and world knowledge by predicting next tokens across trillions of words of text. Fine-tuning, including RLHF, is a much smaller follow-up phase that shapes the model to be helpful, follow instructions, and stay safe, without teaching it new facts.

Question 3

Why do language models hallucinate?

Accepted Answer

Because the model generates the most plausible-sounding next token, not the most factual one. It has no database to check against, so when it lacks knowledge it fills the gap with fluent, confident text that fits the pattern. Hallucination is the same mechanism that makes it fluent, applied where it lacks grounding.

Question 4

What is a context window and why does it matter?

Accepted Answer

The context window is the maximum amount of text, measured in tokens, the model can consider at once — prompt plus generated output. Anything outside it is invisible to the model. A larger window lets you feed more documents or longer conversations, but the whole window is reprocessed each turn, which affects cost and speed.

Question 5

What does temperature control?

Accepted Answer

Temperature controls how random the next-token choice is. Low temperature makes the model pick the most likely tokens, giving focused, repeatable output good for facts and code. High temperature flattens the probabilities so less likely tokens get picked more often, giving more varied and creative output at the cost of reliability.

How Large Language Models Work: A Plain-English Explanation

It is all next-token prediction

How it learned: pre-training then alignment

Context windows, temperature, and other dials

Why they hallucinate