Question 1

What is top-p (nucleus) sampling?

Accepted Answer

Top-p sampling keeps the smallest group of the most likely next tokens whose probabilities add up to a threshold P, then samples from that group. The size of the group changes from step to step depending on how confident the model is.

Question 2

How is top-p different from top-k?

Accepted Answer

Top-k always keeps a fixed number of candidates, while top-p keeps a variable number defined by a probability mass. When the model is confident, top-p narrows to a few tokens; when it is uncertain, it widens to include more, making it more adaptive.

Question 3

What is a good top-p value?

Accepted Answer

A common default is around 0.9 to 0.95, which preserves natural variety while clipping off the long tail of unlikely tokens. Lower values like 0.5 make output more focused and conservative; 1.0 effectively disables the cut-off.

Question 4

Should I change both top-p and temperature?

Accepted Answer

Usually not at the same time. Both control randomness and their effects compound, making behaviour hard to predict. The common practice is to pick one as your primary dial and leave the other at its default value.

Top-P / Nucleus Sampling (AI Glossary)

Definition

How it works step by step

Why adaptivity matters

Choosing a value

Top-p with temperature