Question 1

What is chain-of-thought prompting?

Accepted Answer

Chain-of-thought (CoT) prompting asks the model to work through a problem step by step before giving a final answer, rather than jumping straight to a conclusion. Generating intermediate reasoning improves accuracy on tasks involving arithmetic, logic, and multi-step problems because the model effectively allocates more computation to thinking and is less likely to skip a step.

Question 2

What is the difference between zero-shot and few-shot CoT?

Accepted Answer

Zero-shot CoT simply appends an instruction like 'Let's think step by step' to your prompt, prompting reasoning without examples. Few-shot CoT instead includes a couple of worked examples that show the full reasoning before each answer, teaching the model the reasoning style you want. Few-shot is more reliable but costs more tokens; zero-shot is nearly free and often enough on modern models.

Question 3

Does chain-of-thought work on reasoning models like o1?

Accepted Answer

Reasoning models such as OpenAI's o-series already perform extended internal reasoning automatically, so adding 'think step by step' adds little and can even hurt by interfering. CoT prompting is most valuable on standard chat models that answer in a single pass. For reasoning models, give a clear problem statement and let the model do its own thinking.

Question 4

What is self-consistency?

Accepted Answer

Self-consistency runs chain-of-thought several times with some randomness, producing multiple independent reasoning paths, then takes the most common final answer by majority vote. Because different paths can reach the same correct answer while errors vary, the majority answer is usually more reliable than any single run. It improves accuracy at the cost of running the prompt multiple times.

Chain-of-Thought Prompting: Complete Guide with Examples

Why thinking out loud makes models smarter

Zero-shot CoT: the cheapest win

Few-shot CoT and self-consistency

Tree-of-thought, limits, and when not to use CoT