How to add captions that actually help your edit
6 min read
Captions can lift an edit or quietly drag it down, and most editors never think about which one theirs are doing. Text on screen is one of the eight things Inkroy scores, because it is a real lever: good captions guide attention and add punch, bad ones cover the action, distract, or just sit there as noise. Here is how to use them well.
What captions are actually for
A caption should do a job. Before you add one, know which:
- Set up a moment. A line of text that builds anticipation for what is about to happen ("wait for it," a question, a claim the edit pays off).
- Name or frame the content. Telling the viewer what they are looking at when it adds context.
- Carry the hook. Often the strongest hook is a line of text in the first second that opens a loop.
- Add rhythm or humor. Text that punches in on a beat, or lands a joke at the right second.
If a caption is not doing one of those jobs, it is probably working against you.
The rules for captions that help
Keep them short and readable in a glance
People watch fast. A caption has to be readable in the moment it is on screen. Long sentences get skipped. Cut to the fewest words that do the job.
Time them to the edit
A caption should appear and leave with intention, ideally on the beat, like any other element. Text that lingers after its moment, or shows up late, feels sloppy. Punch it in and take it out.
Do not cover the action
The most common caption mistake: text sitting right over the part of the frame people want to watch. Keep captions clear of the subject and the action. Use the safe zones, away from where platform UI (the caption, the buttons, the username) covers the screen.
Make them legible
Enough contrast to read instantly. A subtle stroke or shadow so text stays readable over busy footage. If your viewer has to squint, the caption failed.
Match the vibe
The font and style should fit the edit. A clean minimal font for a smooth edit; bolder, punchier text for a hype one. Mismatched text style is a small thing that reads as amateur.
When to use no captions at all
Plenty of great edits have zero text. If your edit communicates everything through footage, music, and pacing, captions can just be clutter. The test: turn the captions off in your head. Is the edit worse without them? If not, cut them. Text for the sake of text is noise.
Lyrics are the classic trap here. Putting the song's lyrics on screen feels like "doing something," but unless the lyrics are part of the point, they often just cover the footage and add nothing. Be honest about whether they help.
How captions tie into the rest of the edit
Captions interact with the hook (a text hook is one of the strongest openers, see the 3-second hook) and with pacing (timing text to the beat, see beat syncing). They are not a separate layer you bolt on at the end; they are part of the edit's rhythm.
How to check yours
- Watch with the captions on, fast, once. Could you read each one in time? Did any cover something you wanted to see?
- Watch with them off. Is the edit clearly worse? If not, you do not need them.
- Check the safe zones. Preview where the platform's own UI sits and make sure your text is not hidden behind it.
When you want to know whether your captions are adding or distracting, Inkroy scores Captions as its own dimension and tells you, before you post. Your first analysis is free.