AI avatar video script formatter
Avatar voices read text very literally, so a script that looks fine on the page can sound rushed and robotic on screen. This formatter prepares your script for HeyGen or Synthesia: it inserts pause markers, flags sentences that are too long, surfaces proper nouns that may need a pronunciation guide, and estimates the finished duration.
How it works
The tool processes your text in a few passes:
- Pause insertion — it adds a break marker (
<break>for HeyGen, or a pause hint for Synthesia) between paragraphs and after sentence-final punctuation so delivery has natural rhythm. - Sentence-length check — any sentence over ~25 words is flagged, because long runs make avatars sound breathless.
- Proper-noun scan — capitalised words that appear mid-sentence are listed as pronunciation candidates; these are where TTS most often slips.
- Duration estimate — word count ÷ ~140 wpm gives an approximate runtime.
Tips for natural avatar narration
- Write for the ear, not the page — short declarative sentences read best.
- Spell out tricky names phonetically the first time, e.g. “Gera (GEH-ra)”.
- Keep one idea per sentence so pauses land where the meaning breaks.
- Read the formatted output aloud before rendering — if you stumble, the avatar will too.