Chain-of-thought token cost calculator
Chain-of-thought (CoT) prompting tells the model to reason step by step before giving its final answer. Those reasoning steps are real output tokens, and output tokens are typically the priciest you buy. This calculator shows exactly how much CoT inflates your bill versus a direct answer, so you can spend on reasoning only where it earns its keep.
How it works
You enter the tokens in a direct answer and the extra reasoning tokens that CoT adds before that answer. Both are priced at the selected model’s output rate. The calculator reports the cost of each mode per call, the overhead in dollars, the cost multiplier (how many times more CoT costs), and — at your daily request volume — the additional dollars you spend per day for the reasoning.
Tips and notes
- Reasoning is output, not input. That is why CoT hurts: output tokens often cost 3-4x input tokens, so verbose reasoning compounds fast.
- Target it. Reserve CoT for genuinely multi-step problems; gate it behind a task classifier so simple requests stay cheap.
- Cap the reasoning. Instructing the model to reason concisely, or limiting output tokens, keeps the overhead bounded.
- Estimates only. Reasoning length varies per request; use a realistic average and confirm pricing in your provider dashboard.