AI Video Hook Formulas: 3-Second Opener Templates for Ads, Tutorials, and Stories
· Genra AIThe first 3 seconds of any AI video decide whether the next 30 get watched. Here are 15 hook formulas — 5 for ads, 5 for tutorials, 5 for stories — with the structure, the AI prompt, and the metric each one is built to move.
If you've spent any time analyzing video performance dashboards in 2026, you already know the brutal arithmetic: roughly half of viewers drop off in the first 3 seconds. On TikTok and Reels the curve is even sharper — closer to 65% gone before second 3 if the opener doesn't catch them. Every minute of effort you put into the rest of your video is multiplied (or zeroed out) by what those first 3 seconds do.
The instinct most creators have, including those using AI video tools, is to put a lot of work into the body of the video and treat the opener as something that "introduces" the content. That's backwards. The opener is the highest-leverage 3 seconds in the entire piece. Get them right and your retention curve flattens. Get them wrong and your bounce rate makes the rest of the work invisible.
This article is a working library: 15 hook formulas, organized by video type. Five each for the three formats AI creators ship most: ads (paid social, performance creative), tutorials (how-to, explainer, educational), and stories (narrative, brand, lifestyle). For each formula you'll get the structure, an AI video prompt that produces it, the kind of subject it works for, and the metric it tends to move.
None of these are theoretical. They're patterns pulled from millions of high-retention videos analyzed across TikTok, Reels, Shorts, and paid social over the last 18 months. The same patterns work whether the video is shot live, made in Premiere, or generated by an AI agent.
Why the First 3 Seconds Are Everything
The 3-second cutoff isn't arbitrary. It's a function of how modern social platforms measure value:
- TikTok uses a "video completion rate" signal that weights early dropoff heavily. A 3-second swipe-away tells the algorithm the content failed.
- Meta (Reels & Stories) measures "ThruPlay" but weights average watch time. Videos that lose viewers in the first 3 seconds are penalized in distribution and in ad cost.
- YouTube Shorts measures average view duration. The first 3 seconds are where the curve makes or breaks.
- Paid creative on every platform shows you a fast retention chart. The cliff at second 3 is the line between a winning ad and a losing one.
For AI video specifically there's an extra layer. Audiences in 2026 have been trained to recognize "AI-looking" content at a glance and scroll past it as a category. Your hook isn't just competing for attention — it's also pre-empting the "this is AI slop" reflex. A strong hook earns you 7 more seconds before the audience even starts evaluating quality.
The Three Things a Hook Has to Do
- Stop the scroll. Visual or audio pattern break that interrupts auto-scroll behavior.
- Promise a payoff. Make it clear that watching the next 20–30 seconds will be worth it.
- Suit the format. Ad hooks promise a benefit. Tutorial hooks promise a learning. Story hooks promise an emotional arc.
Every formula below is a different way of doing all three.
5 Hook Formulas for Ads
Ad hooks have to do something the other two formats don't: they have to surface the product or the problem within 3 seconds. You don't have time to build a story arc. You have time to either name a pain point so accurately that the audience feels seen, or to make a promise so concrete that they want to know how it gets delivered.
A1. The Specific Pain Hook
Structure: name a hyper-specific frustration the target audience has had within the last 24 hours.
Pattern: "If you've ever [specific tiny pain]…"
Example: "If you've ever spent 40 minutes choosing background music for a 30-second video, this is for you."
AI prompt: "Close-up of a frustrated creator scrolling through audio library, deflated body language, dim ambient lighting, voiceover names the specific pain in the first 2 seconds."
Best for: productivity tools, SaaS, agencies. Moves: CTR, install rate.
A2. The Big Specific Promise Hook
Structure: a concrete outcome with a timeframe, ideally with a number.
Pattern: "[Specific result] in [specific time], without [common requirement]."
Example: "Make a finished 30-second product video in under 4 minutes. No editing. No script writing."
AI prompt: "Split-screen: left side shows a stopwatch counting from 4:00 to 0:00, right side shows a finished product video assembling shot-by-shot in real time. Bright, high-energy color grade."
Best for: consumer apps, e-commerce, any product with a quantifiable outcome. Moves: CTR, conversion rate.
A3. The Pattern Interrupt Hook
Structure: a visual or verbal pattern break that contradicts what the viewer expects to see.
Pattern: open with the most surprising or counter-intuitive moment from the entire video.
Example: open a vacuum cleaner ad with a wide shot of someone deliberately spilling coffee on a white rug.
AI prompt: "Wide static shot, white living room rug, coffee cup tips over in slow motion in the first second, dark stain spreads. No voiceover, ambient room sound only."
Best for: CPG, home goods, demos. Moves: thumb-stop rate, watch time.
A4. The Negation Hook
Structure: tell the viewer NOT to do something everyone else is doing. Negation creates curiosity faster than affirmation.
Pattern: "Don't [common behavior] until you've seen this."
Example: "Don't write your next AI video prompt until you've watched 20 seconds of this."
AI prompt: "Direct-to-camera medium shot, creator holding up a hand in 'stop' gesture, sharp eye contact, slight lean toward camera, neutral background, hard front lighting."
Best for: education, tools, anything where the alternative is bad. Moves: watch time, share rate.
A5. The Social Proof Hook
Structure: open with a specific, credible result someone else got. Numbers and names beat adjectives.
Pattern: "[Specific person/title] used [thing] to [specific result]."
Example: "Three Shopify store owners are getting 8x ROAS on AI-made ads. Here's exactly what they're doing differently."
AI prompt: "Triptych composition: three split panels, each showing a different store owner working at their setup, soft natural light, on-screen text overlays the ROAS number for each one."
Best for: B2B, agencies, courses, anything where credibility matters. Moves: lead-form completion, conversion.
5 Hook Formulas for Tutorials
Tutorial hooks have a different job. The viewer is there to learn something. You don't need to convince them to want a payoff — you need to prove fast that the payoff exists, that you know how to deliver it, and that watching is worth the time investment over the dozen other tutorials they could have clicked on.
T1. The Result-First Hook
Structure: show the finished outcome before you explain anything.
Pattern: "Here's what we're building. Now let me show you how."
Example: a 2-second flash of the finished video, then "in the next 90 seconds I'll show you exactly how to make this."
AI prompt: "Open with a 2-second high-quality preview of the finished output, hard cut to creator on a clean studio set, direct address to camera, holding the same finished output on a tablet."
Best for: design, video, code, recipes — anything visual. Moves: average view duration, like rate.
T2. The Mistake / Misconception Hook
Structure: name a mistake your audience is making, ideally one they don't realize they're making.
Pattern: "Most people [common approach]. Here's why that's making [outcome] worse."
Example: "Most creators write AI video prompts in one long paragraph. That's why their generations look generic. Here's the fix."
AI prompt: "Side-by-side comparison: left shows a long paragraph prompt with messy text, right shows a clean structured prompt format. Slight zoom-in on the right side."
Best for: education, expert content, anything where the audience already has habits. Moves: watch time, save rate.
T3. The Authority Hook
Structure: establish credibility in the opening line, then promise a specific learning.
Pattern: "I've [specific credential / experience]. Here's the [framework / system / mistake] I wish I'd known."
Example: "I've shipped over 200 AI-generated ad creatives. Here are the 3 hook patterns that consistently outperform the rest."
AI prompt: "Medium close-up direct address, creator in well-lit home office, books or work artifacts visible in background, calm authoritative delivery."
Best for: expertise content, B2B, professional services. Moves: lead generation, follower conversion.
T4. The Shortcut Hook
Structure: promise the fastest path to the result.
Pattern: "The fastest way to [outcome]. No [common slow method] required."
Example: "The fastest way to make a product video that converts. No camera, no actors, no editing software."
AI prompt: "Time-lapse style assembly of a finished video, on-screen counter ticks down from a long time to a short time, vibrant high-energy edit."
Best for: tools, SaaS, productivity. Moves: install rate, signup conversion.
T5. The Question-Reframe Hook
Structure: open with the exact question your viewer is silently asking, then signal you have a non-obvious answer.
Pattern: "Why do [X people] keep [problem]? It's not [obvious answer]."
Example: "Why do most AI videos still look fake? It's not the model. It's something almost no one is paying attention to."
AI prompt: "Direct-to-camera, creator pauses for a beat after asking the question, soft side lighting, intimate medium close-up."
Best for: thought leadership, deep dives, opinion content. Moves: comment rate, share rate.
5 Hook Formulas for Stories
Story hooks are a different beast. The audience isn't there for a benefit or a learning — they're there for an emotional arc. The hook has to drop them into the middle of one fast enough that they want to know how it ends.
S1. In Medias Res (Mid-Action Open)
Structure: start in the middle of the most charged moment of the story, then flash back.
Pattern: Open on the climax. Cut to "3 hours earlier" / "let me back up."
Example: open on a runner crossing a finish line in tears, then cut to "I'd just spent 6 months unable to walk."
AI prompt: "Slow-motion close-up of the climactic moment, intense expression, ambient sound only. Hard cut to a flat handheld shot of an earlier mundane scene with on-screen text 'six months earlier.'"
Best for: brand films, transformation content, documentary-style. Moves: completion rate, share rate.
S2. The First-Person POV Hook
Structure: drop the viewer into a first-person perspective immediately, with stakes.
Pattern: "I have [time pressure / stakes]. [Specific action.]"
Example: "I had 24 hours to figure out if my product idea was real. So I generated 50 ad variants overnight."
AI prompt: "First-person POV shot, hands typing rapidly at a laptop, dim room with screen light on the face, voiceover starts mid-sentence as if mid-thought."
Best for: founder content, lifestyle, personal stories. Moves: average view duration, comment engagement.
S3. The Visual Anomaly Hook
Structure: open with a single image that doesn't make sense yet. The viewer stays to find out why.
Pattern: show an unexpected scene with no explanation. Let curiosity hold them.
Example: open on a fully suited businessman pushing a shopping cart full of basketballs through a parking lot.
AI prompt: "Wide static shot, businessman in full suit pushing a shopping cart full of basketballs through an empty parking lot at golden hour. No voiceover, no on-screen text, ambient sound only. 4 seconds."
Best for: brand storytelling, viral content, surreal/comedic. Moves: thumb-stop, share rate, watch time.
S4. The Direct Stakes Statement
Structure: state the stakes of the story in one sentence before any context.
Pattern: "[Time / situation] when I [made the decision]."
Example: "It was 11 PM when I decided to delete the entire app and start over."
AI prompt: "Medium shot, dimly lit, the speaker mid-action — hand hovering over a 'delete' button on a laptop, cool color temperature, voiceover begins as the action freezes."
Best for: founder content, behind-the-scenes, decision-driven narratives. Moves: completion rate, follower conversion.
S5. The Confession Hook
Structure: open with an admission the audience doesn't expect from someone in your position.
Pattern: "I [unexpected admission for someone like me]."
Example: "I run a video agency and I haven't picked up a camera in 11 months."
AI prompt: "Tight medium close-up, soft window light, eye contact direct to camera, slight smile, casual delivery as if mid-conversation, neutral background."
Best for: personal brand, expert content, contrarian takes. Moves: share rate, follow rate.
AI Video Hooks Have Two Extra Rules
Every formula above works for any video. AI-generated video adds two constraints worth knowing.
1. Visuals Have to Carry the Hook Before the Voiceover Lands
By 2026, the majority of social video gets watched on muted autoplay. Your visual hook has to land in frame 1, not in word 1 of the voiceover. This means: the on-screen action, composition, and color have to telegraph the hook even if the audio never plays. Test it muted. If you don't get it muted, the hook is failing.
2. The First Shot Has to Pre-empt "AI Slop" Pattern Recognition
Audiences in 2026 are pattern-trained to spot AI video instantly: too-smooth motion, plastic skin, generic compositions, soft default lighting. If your first frame matches that pattern, viewers scroll before your hook can land. Counters: open with a specific real-feeling detail (a hand fidgeting, hair moving in wind, a believable shadow), use a documented cinematography choice (named lens, specific lighting source), or open with a static shot that emphasizes texture rather than motion. The goal isn't to hide that the video is AI — it's to make the first frame look like deliberate craft, not auto-generation.
Both rules apply to every hook formula above. Treat them as filters, not as separate techniques.
Picking the Right Hook for Your Video
Three rules of thumb that hold up in production:
- Match hook to format. Don't use a story hook for an ad — you'll waste your CTR. Don't use an authority hook on a comedy story video. The hooks below are sorted by format for a reason.
- Match hook to audience temperature. Cold audiences (paid traffic, FYP) need bigger contrast in the hook — pattern interrupts, big promises, visual anomalies. Warm audiences (subscribers, retargeting) tolerate slower, more curiosity-driven hooks (questions, confessions).
- Test hooks, not full creatives. The single highest-leverage A/B test in performance video is replacing the first 3 seconds and leaving the rest of the video untouched. You'll see CTR or retention move 30–60% before any other variable.
The complete library:
| Code | Hook | Format | Best Use Case | Metric Moved |
|---|---|---|---|---|
| A1 | Specific Pain | Ad | SaaS, agencies | CTR |
| A2 | Big Specific Promise | Ad | Apps, e-commerce | Conversion |
| A3 | Pattern Interrupt | Ad | CPG, demos | Thumb-stop |
| A4 | Negation | Ad | Education, tools | Watch time |
| A5 | Social Proof | Ad | B2B, courses | Lead form |
| T1 | Result-First | Tutorial | Visual fields (design, video, code) | View duration |
| T2 | Mistake/Misconception | Tutorial | Expert content | Save rate |
| T3 | Authority | Tutorial | B2B, professional | Lead gen |
| T4 | Shortcut | Tutorial | Tools, SaaS | Signup |
| T5 | Question-Reframe | Tutorial | Thought leadership | Comment rate |
| S1 | In Medias Res | Story | Brand films | Completion |
| S2 | First-Person POV | Story | Founder content | View duration |
| S3 | Visual Anomaly | Story | Brand, viral | Share rate |
| S4 | Direct Stakes | Story | Decision narratives | Completion |
| S5 | Confession | Story | Personal brand | Follow rate |
How Genra Handles This
Picking the right hook is a real skill. Even with this library in hand, applying it correctly to a specific brief — getting the format right, the audience temperature right, the AI-slop counter right, and the visual / audio carry right — takes practice.
Genra's agent is built to do this end-to-end. When you describe what video you want, Genra picks the format (ad, tutorial, story), selects a hook formula appropriate to your audience and goal, and writes the opener so the visual and the voiceover both carry it. You don't have to know the difference between an A2 and a T1 — the agent does.
If you'd rather learn the formulas and apply them yourself, this article is your reference. If you'd rather skip to finished videos that already use them, try Genra free. 40 credits, no card.
Key Takeaways
- Roughly half of viewers leave in the first 3 seconds. The opener is the highest-leverage 3 seconds in your video.
- A working hook has to do three things: stop the scroll, promise a payoff, and suit the format.
- Ad hooks surface a pain or a benefit fast: Specific Pain, Big Specific Promise, Pattern Interrupt, Negation, Social Proof.
- Tutorial hooks prove fast that the payoff exists and the payoff is worth your time: Result-First, Mistake/Misconception, Authority, Shortcut, Question-Reframe.
- Story hooks drop the audience into an emotional arc: In Medias Res, First-Person POV, Visual Anomaly, Direct Stakes, Confession.
- AI video hooks have two extra rules: the visual has to land before the voiceover (assume mute), and the first frame has to look like craft, not auto-generation.
- The single highest-leverage A/B test in performance video is replacing the first 3 seconds. Expect 30–60% movement on CTR or retention.
- Match hook to format, audience temperature, and (crucially) test hooks instead of full creatives.
Frequently Asked Questions
Why exactly 3 seconds, not 5 or 10?
Three seconds is the cutoff platform algorithms care about most. TikTok, Reels, and Shorts all weight the very early dropoff signal heavily. Statistically about half of all viewers who will eventually leave are gone by second 3. The 5-10 second window matters too, but it's a secondary battle — if you lose the audience by second 3, the rest doesn't get to play.
Can I use the same hook formula for multiple videos in a campaign?
You can, but you shouldn't. The same hook applied to 10 different videos in the same feed becomes a pattern your audience starts to recognize and skip. Rotate at least 3 hook formulas across a campaign. The performance lift compounds.
Should I write the hook before or after the body of the video?
Both, in two passes. Draft a hook before you write the body so you know what you're delivering against. Then rewrite the hook after the body is done — by then you know the strongest specific moment in the body, and the strongest hook is often a 2-second flash of that moment plus a one-line tease.
Do I really need a different hook for ads vs. tutorials vs. stories?
Yes. The audience's mental contract is different for each. Ad viewers are scanning for "is this for me." Tutorial viewers are scanning for "will this teach me something." Story viewers are scanning for "is this going somewhere." A hook that violates the format's contract loses the audience even when the rest of the video is good.
How long should the rest of the video be after a strong hook?
It depends on the platform and format. On TikTok and Reels, 25–35 seconds is the sweet spot for retention-weighted reach. On Shorts, 35–45 seconds. For paid social ads, 12–18 seconds for prospecting and 30–45 for retargeting. Don't extend past the natural arc just to fill time — the audience will leave the moment payoff stops.
Can AI video models actually produce these hook visuals well?
Yes, with one caveat: hooks that depend on highly specific human emotional expression (subtle confession smile, layered grief, comedic timing) are still where AI video models are weakest. For those, either use a generated establishing shot plus stock or filmed close-up for the emotional beat, or pick a hook formula that depends more on composition and action than on facial nuance.
What's the most reliable hook for a brand-new product nobody knows yet?
For unknown products, A2 (Big Specific Promise) and A3 (Pattern Interrupt) consistently outperform. They don't require the audience to recognize the brand. They require the audience to recognize a pain or be stopped by a visual. Save the social-proof hooks (A5) for after the product has results to cite.
How do I test which hook is winning without spending a fortune?
Run 3–5 hook variants over the same 20-second body, push small budget ($20–50/variant) on Meta or TikTok, and look at thumb-stop rate at second 3 and watch time at second 7. The differences usually surface within 24 hours and within $200 of total spend. Then push budget into the winner.
About the Author
The Genra AI team builds tools that help creators produce professional video content using AI. Follow @GenraAI for updates, tutorials, and honest takes on the AI video space.