Question 1

What is a reasoning model?

Accepted Answer

A reasoning model is an LLM trained to spend extra computation 'thinking' — generating a long internal chain of reasoning — before producing a final answer. This improves accuracy on hard, multi-step problems like maths, coding and logic.

Question 2

How are reasoning models different from standard LLMs?

Accepted Answer

Standard LLMs answer in roughly one pass. Reasoning models use test-time compute: they produce hidden reasoning tokens first, effectively trading speed and cost for higher accuracy on difficult tasks.

Question 3

Are reasoning models always better?

Accepted Answer

No. They are slower and more expensive, and the extra thinking adds little on simple tasks. Use them for hard reasoning, maths, planning and complex code; use standard models for everyday chat, drafting and retrieval.

Question 4

Do I pay for the hidden thinking tokens?

Accepted Answer

Usually yes. Providers typically bill the internal reasoning tokens as output even though you do not see them, which is why reasoning calls can cost noticeably more.

Use a reasoning model for	Use a standard model for
Hard maths and proofs	Everyday chat and Q&A
Complex, multi-file coding	Drafting and rewriting
Multi-step planning and analysis	Summarising and extraction
Tricky logic and debugging	High-volume, latency-sensitive tasks

AI Reasoning Models Explained: o1, o3, Gemini Thinking vs Standard LLMs

What is a reasoning model?

How they differ from standard LLMs

Test-time compute, briefly

When to use a reasoning model

The cost trade-off