A caption should stay on screen long enough to read without effort: about 1 second minimum, up to roughly 6 seconds maximum, timed to how long the words are spoken. The classic guideline is a reading speed around 15–17 characters per second. In animated short-form captions, words appear and clear individually, so no single line ever lingers.
The reading-speed rule
Caption timing comes down to one thing: can a viewer finish reading the line before it disappears? The long-standing broadcast convention is about 15–17 characters per second — comfortable for most adults. A short line of 5–7 words fits in roughly 2–3 seconds at that pace.
Two guardrails bracket it: a caption should stay up for at least about a second even if the phrase is tiny (so it doesn't flash), and no longer than about six seconds (past that it feels stuck and the sync drifts from the audio).
Short-form plays by different math
Traditional subtitle timing assumes full sentences on screen. But the word-by-word style that dominates TikTok and Reels works differently: each word appears exactly when it's spoken and clears a beat later. There's no sentence to read — just a word to recognize — so the captions can move as fast as the speech without losing the viewer.
That's why fast-cut captions look almost frantic but still feel easy to follow. The timing isn't about giving you time to read a block; it's about keeping the on-screen word locked to the voice.
Get word-level caption timing synced to your audio automatically.
Try it freeTiming mistakes that cost you viewers
- Captions that lag the audio. If the text trails what's being said, it reads as broken. Word-level sync keeps them locked together.
- Lines that flash too fast. A full phrase held for under a second can't be read. Give phrase-style lines at least a second.
- Lines that overstay. A caption sitting long after the words were spoken feels stuck and makes the video look unedited.
- Manual keyframing everything. Timing captions by hand is slow and error-prone; automatic sync gets you 95% of the way in seconds.
How to get the timing right automatically
You shouldn't be counting characters per second on every line. When you upload a clip to AnimateCaptions, it transcribes with word-level timestamps and syncs each caption to exactly when it's spoken — so the timing is right by default. Adjust anything you want, then export a burned-in MP4 with the pacing locked in.
