Exercises: The Scroll-Stop Moment

These exercises progress from concept checks to challenging applications. Estimated completion time: 3 hours.

Difficulty Guide: - ⭐ Foundational (5-10 min each) - ⭐⭐ Intermediate (10-20 min each) - ⭐⭐⭐ Challenging (20-40 min each) - ⭐⭐⭐⭐ Advanced/Research (40+ min each)


Part A: Conceptual Understanding ⭐

A.1. Explain the difference between pre-attentive processing and conscious evaluation. At what point in the scroll-stop timeline does each occur?

A.2. List four visual features that the brain processes pre-attentively. For each one, give a specific example of how it could be used in a video's first frame.

A.3. Why do pattern interrupts eventually stop working? Give an example of a scroll-stop technique that was once effective but has become so common it no longer interrupts the pattern.

A.4. Explain the difference between a scroll-stop (earning a pause) and clickbait (stealing a pause). How can a viewer tell the difference?

A.5. What is the "thumbnail promise"? Why is it particularly important on YouTube compared to TikTok?

A.6. Describe the S.T.O.P. framework. What does each letter stand for, and why is each element important?

A.7. The chapter states that audio hooks are part of the scroll-stop moment. On what platforms is this most true, and on what platforms might it matter less? Why?


Part B: Applied Analysis ⭐⭐

B.1. The Squint Test in Practice Take screenshots of the first frames of five different videos from your feed. For each one: - Apply the squint test (squint until the image blurs) - Identify the most salient element - Score each on the salience hierarchy from Section 3.3 - Rank the five from most to least likely to stop your scroll, and explain your ranking

B.2. S.T.O.P. Framework Audit Choose three videos from your favorite creator. Score each one's opening using the S.T.O.P. framework (Salience, Tension, Ownership, Promise — each scored 1-5). Which video scores highest? Does the S.T.O.P. score correlate with the video's performance (views, engagement)?

B.3. Audio Hook Analysis Open TikTok or Instagram Reels and listen to the first 2 seconds of 10 consecutive videos (don't watch — just listen). Categorize each audio hook using the types from Section 3.5 (cold open, bold claim, sound effect, vocal contrast, question, or other). Which type appears most frequently? Which type grabs YOUR attention most effectively?

B.4. The Title-Thumbnail Contract Find three YouTube thumbnails and titles that create a strong "contract" with the viewer (clear promise, narrative tension) and three that create a weak contract (vague, generic). Analyze what makes the strong ones work using the framework from Section 3.4.

B.5. Pattern Interrupt Lifecycle Identify a scroll-stop technique that you've seen become popular recently. Document its lifecycle: - When did you first notice it? - How many creators have adopted it? - Is it still effective as a pattern interrupt, or has it become the new pattern? - What could replace it while maintaining the same psychological principle?


Part C: Real-World Application Challenges ⭐⭐-⭐⭐⭐

C.1. The 10-Hook Challenge ⭐⭐ Choose one topic you care about (anything: a hobby, a school subject, an opinion, a skill). Write 10 different scroll-stop openings for a video about that topic — using 10 different techniques from the 50 in Section 3.6. For each: - Describe the first frame (visual) - Write the first 2 seconds of audio - Identify which technique you're using - Score it on the S.T.O.P. framework

C.2. The Scroll-Stop A/B Test ⭐⭐⭐ Create two versions of the same video with different openings — one designed using the S.T.O.P. framework (target score 16+) and one using your natural instinct (no framework). Post both (at different times or on different platforms) and compare: - Views after 24 hours - Average watch time - Scroll-stop rate (impression-to-view ratio, if your platform shows this) Write a brief analysis of which performed better and why.

C.3. The Feed Context Experiment ⭐⭐⭐ Design three different first frames for the same video, each optimized for a different feed context: - Frame A: For a feed full of talking-head videos (needs to contrast with faces) - Frame B: For a feed full of text-overlay memes (needs to contrast with text) - Frame C: For a feed mixed with high-production content (needs to be visually striking) Explain how feed context changes your scroll-stop strategy.

C.4. The Scroll-Stop Redesign Challenge ⭐⭐⭐ Take your three lowest-performing videos (or choose three from a small creator). For each: 1. Score the original opening using S.T.O.P. 2. Identify the weakest element(s) 3. Design a new opening that scores 16+ 4. Explain the specific psychological principles driving each improvement


Part D: Synthesis & Critical Thinking ⭐⭐⭐

D.1. The chapter presents 50 scroll-stop techniques. But could optimizing for scroll-stops have diminishing returns? If every creator uses these techniques, does the overall quality of the feed improve (more interesting content) or deteriorate (more sensationalism)? Make an argument for both sides.

D.2. The chapter mentions that "the scroll-stop is a promise." But many successful videos have scroll-stops that are loosely connected to the actual content (a shocked face, dramatic text) without being outright clickbait. Where exactly is the line between a compelling hook and a misleading one? Can you define a clear boundary, or is it inherently fuzzy?

D.3. Compare the scroll-stop challenge to attention-getting in other contexts: a book cover in a bookstore, a headline in a newspaper, a movie trailer, a food truck's menu design. What principles are universal across all these "first impression" contexts, and what's unique to the social media scroll-stop?

D.4. The S.T.O.P. framework scores "Ownership" — making the viewer feel the content is for them specifically. But hyper-targeting ("If you're a 16-year-old who plays guitar and has a dog...") limits reach. How do you balance specificity (high ownership) with broad appeal? Is there a way to make content feel personal without being exclusive?


Part E: Research & Extension ⭐⭐⭐⭐

E.1. Research the concept of "saliency maps" in computer vision and neuroscience. Find an online saliency map generator (several free ones exist) and run your own video thumbnails or first frames through it. Compare the predicted gaze pattern to where you intended the viewer to look. Write up your findings.

E.2. The chapter discusses pre-attentive processing as happening within 40-80ms. Find the original vision research supporting this timeline. How was it measured? What specific experimental paradigms were used? How confident are scientists in these timing estimates?


Solutions

Selected solutions available in appendices/answers-to-selected.md