LLM API Cost Calculator

Estimate the cost of any GPT, Claude or Gemini API call — per request and per month.

Ad placeholder (leaderboard)

LLM API cost calculator — per request and per month

Estimate what an OpenAI, Anthropic or Google AI API call will actually cost before you ship. Pick a model, enter the input and output tokens for a typical request, add your monthly volume, and see the cost per call and the projected monthly bill update instantly.

How LLM pricing works

Large language models bill per token, and almost always price input (your prompt) and output (the model’s completion) at different rates — output is typically 3–5× more expensive. Prices are quoted per million tokens, so the cost of one request is:

cost = (input_tokens / 1,000,000) × input_price
     + (output_tokens / 1,000,000) × output_price

A token is roughly ¾ of an English word, so 1,000 tokens ≈ 750 words.

Tips to cut your bill

  • Trim the prompt. Long system prompts are billed on every call — move stable context into a cached or shorter form.
  • Cap output. Set max_tokens so a runaway completion can’t 10× your cost.
  • Right-size the model. A mini/flash model is often 10–20× cheaper and good enough for classification, extraction and routing.
  • Batch and cache. Prompt caching and batch APIs can cut input costs sharply at volume.
Ad placeholder (rectangle)