OpenAI TTS voice picker
OpenAI’s speech models ship six voices — alloy, echo, fable, nova, onyx, and shimmer — and the right one depends entirely on what you are making. This picker pairs each voice with its tone, best use cases, and a sample phrase, then recommends a shortlist based on your use case and tone preference.
How it works
Every voice has a consistent character regardless of model. The picker scores each one against two inputs:
- Use case — narration, assistant/agent, audiobook, advertisement, or character — because a calm reader and a punchy promo voice rarely overlap.
- Tone — warm, neutral, or authoritative — which biases toward the voices that carry that feel.
It then surfaces the top matches with notes on speed (the API’s speed
parameter, 0.25–4.0) and audio format so you can drop the choice straight
into your request.
Tips for choosing a voice
- Audition with your real script. A generic “hello world” hides how a voice handles your actual sentences, names, and pacing.
- Match voice to medium. Onyx and alloy read long-form well; nova and shimmer feel friendlier for assistants and welcome flows.
- Set speed deliberately. Drop to ~0.9 for instructions, push to ~1.1–1.2 for snappy summaries.
- Pick format by destination. mp3/aac to ship, opus to stream, wav/flac when you will edit the audio afterward.