LLM API Cost Estimator

Estimate monthly LLM API costs from usage volume and model selection.

Ad placeholder (leaderboard)

Before you ship an LLM feature, you need to know what it will cost at scale. This estimator turns daily volume and average token sizes into per-request, daily and monthly cost — and compares every major model side by side so you can pick the cheapest one that meets your quality bar.

How it works

Cost per request is computed from each model’s published per-million-token rates:

cost = (prompt_tokens / 1M × input_price) + (completion_tokens / 1M × output_price)

That is multiplied by your daily request count for the daily figure and by ~30 for the monthly figure. Output tokens are priced several times higher than input tokens on most models, so verbose completions dominate the bill. All math runs locally in your browser.

Tips

Use realistic average token counts, not best-case ones — about four characters per token for English text. The model comparison is the highest-leverage part: the same workload can cost 10× more on a frontier model than a small one, so routing easy requests to a cheaper model is often the biggest saving available. Note these figures assume uncached pricing; if you reuse a stable system prompt, prompt caching can cut input costs substantially, making this an upper-bound estimate.

Ad placeholder (rectangle)