ES Start a project →
Editing terms · 2026

YouTube editor glossary 2026

30+ essential editing terms every creator and aspiring editor should know. From hook engineering to retention curves to the J-cut: the language that separates craft from guessing.

By Kevin Tabares · Apr 21, 2026 · 10 min read

Most creators brush past editing terminology like it's jargon. It's not. When your editor says "the retention curve is nosediving at the mid-roll" or "let me duck the ambient sound," they're being precise about what they're fixing. Knowing the language means you can direct them faster, spot problems in your own drafts, and most importantly: avoid paying for work you don't understand.

This glossary covers the terms that actually matter in long-form YouTube editing — the ones that show up in feedback calls, in retention graphs, and in the difference between a video that holds its audience and one that bleeds viewers at predictable moments.

Hooks and retention

Hook: The opening 15 seconds of your video. In long-form YouTube, the hook is the difference between 70% retention and 30%. A hook isn't cute — it's functional. It must make a viewer ask a question they want answered in the next five minutes.

Pattern interrupt: A cut, sound effect, B-roll change, or visual transition that breaks the viewer's attention pattern and resets engagement. When retention graphs show a dip at 4:30, an editor uses pattern interrupts to hold you before the fall.

Retention curve: The line in YouTube Studio that shows what percentage of your audience is watching at each moment. A flat retention curve means your audience is staying. A curve that drops sharply means they're leaving at a predictable moment — usually because something broke the pacing.

Watch time: The total minutes watched across all viewers of a video. YouTube prioritizes videos with high watch time. A video with 40% retention to the end generates far more watch time than a video with 60% retention that people click away from halfway through.

CPM (Cost Per Mille): How much advertisers pay YouTube per 1,000 ad impressions. Higher CPM usually means higher-income viewers. Editorial content often has higher CPM than gaming.

RPM (Revenue Per Mille): How much you make per 1,000 views, after YouTube takes its cut. RPM is lower than CPM. Your actual take-home scales with both view count and RPM.

The retention-RPM connection: Every percentage point of retention you gain is compounding revenue. At a $3 RPM, a 5% retention lift on 200K views worth $600 total is $30 extra per video, recurring on your channel forever.

Editing and cuts

J-cut: When the audio from the next scene starts before the video cuts to it. You see the current scene, hear the next person start talking, and then you cut to their face. Feels smooth. Most TV uses this.

L-cut: The opposite. Video from the next scene appears before the audio cuts to it. You cut to a new visual while the old person is still talking, creating a smooth visual transition while dialogue continues.

Cross-fade: Two shots blend from one to the other over 200-400ms. Not a hard cut. Used for pacing — every cross-fade adds a tiny breath.

Jump cut: You cut from one moment to the next with no transition. The speaker jumps from position A to position B on screen. Can feel jarring on purpose — used for energy or comedic timing.

B-roll: Any footage that isn't your main shot. If you're talking to camera, B-roll is the screen recordings, cutaways, or secondary footage that plays over your words. Editors use B-roll to hide cuts, break pacing, and make a boring moment visually interesting.

Cutaway: Similar to B-roll, but more specific — footage that cuts away from the main speaker. Used to break up static talking-head footage.

Sound and audio

Ducking: Lowering the volume of background audio or music when the primary audio (your voice) plays. The music "ducks" under the speech so you stay clear. Without ducking, your voice competes with background music.

Ambient sound: Background audio that isn't dialogue or music. Room tone, keyboard clicks, air conditioning hum. Most editors remove or minimize it for clean audio.

Foley: Sound effects that are recorded or added in post to match on-screen actions. A keyboard click, a door slam, a pen click — these are foley. They make edits feel physical.

Sound design: The intentional use of audio effects, music, and foley to shape the mood and pacing of a video. Good sound design is invisible but essential — it's why a video feels polished vs amateur.

Technical and platform terms

AVD (Average View Duration): YouTube's metric for how long, on average, viewers watch your video. If your video is 20 minutes and your AVD is 12 minutes, you're holding 60% of viewers on average.

Dropout rate: The inverse of retention. How many viewers are leaving at each moment. A steep dropout curve at 2:00 means you're losing audience at that point — usually because the hook failed.

Click-through rate (CTR): What percentage of impressions result in a click to watch the video. Thumbnail quality usually drives CTR. A 5% CTR is strong; 2% is weak.

Impressions: How many times YouTube showed your thumbnail to a viewer (whether they clicked or not). More impressions = more opportunities for clicks.

Frame rate: How many individual images play per second. 24fps feels cinematic, 30fps is standard for most content, 60fps is smooth for motion or gaming. YouTube renders all of these fine — the choice is creative.

Aspect ratio: The width-to-height ratio of your video. 16:9 is standard YouTube long-form. Shorts and TikTok use 9:16 (portrait). Knowing the ratio matters when you're designing text or B-roll.

Workflow and process terms

Project file: The editor's working file in Premiere Pro or DaVinci Resolve. It contains all the clips, transitions, and effects. Most contracts include delivery of the project file so you own the edits.

Revision round: When you ask for changes and the editor reworks the video. Most contracts include two rounds. Asking for changes after that is either free or costs extra.

Turnaround: The time between when you hand off footage and when the editor delivers the first cut. 24-72 hours is standard for long-form. Same-day turnaround is rare and usually costs more.

Style guide: A document or reference that shows the editor what your visual brand is. Color grading preferences, music choices, transition styles, text fonts. A good style guide prevents revision rounds.

"The difference between a cheap edit and a retention-led edit is how much of the editor's brain lives in the file. Do they only cut? Or do they engineer for watch time?"

Learn the language to direct faster

Knowing these terms doesn't make you an editor. But it makes you a better client of an editor. When you can say "the retention is dropping at 4:30, I need pattern interrupts or B-roll" instead of "it feels boring there," your editor gets specific direction and nails it faster. You also spot problems faster when you're watching your own rough cuts.

Most of the creators we work with become fluent in this language after two or three videos. You start noticing the J-cuts in your own edits, you can read a retention graph with clarity, and you know what to ask for to fix a flat zone.

Related guides

Strategy
What retention-led editing actually means (and when it's marketing fluff)
Retention
The 30-second rule: engineering YouTube hooks that hold retention
Hiring
How to test an editor in one paid trial video