OpenAI TTS cost calculator
OpenAI’s speech API charges per character of input text, so the cost of an audiobook or a podcast intro comes down to script length and which model you pick. This calculator converts your word or character count into a USD estimate for both tts-1 and tts-1-hd, plus the expected audio duration.
How it works
The math is straightforward:
characters = words × 6 (≈ chars per English word)
cost = characters × price_per_char × quantity
duration = words / 150 wpm (normal speaking pace)
tts-1 is the budget, lower-latency model; tts-1-hd costs more per character for higher fidelity. The calculator shows both so you can see exactly what the quality upgrade costs for your volume.
Tips for controlling TTS spend
- Edit before you synthesize. You pay for every character of input — tighten the script first.
- Reserve HD for published audio. Use tts-1 for drafts, previews, and real-time agents; switch to tts-1-hd only for the final render.
- Reuse static segments. Generate fixed intros and disclaimers once and splice them in rather than re-synthesizing each run.
- Batch and cache. Identical inputs produce billable calls every time — cache the output audio, not just the text.