42 min read

There is a moment in every game designer's career when they write their first cutscene. They have something important to show — a villain's monologue, a betrayal, a death — and they reach for the most familiar tool in storytelling: a scene where the...

Chapter 23: Cutscenes, Cinematics, and When to Take Control Away from the Player

There is a moment in every game designer's career when they write their first cutscene. They have something important to show — a villain's monologue, a betrayal, a death — and they reach for the most familiar tool in storytelling: a scene where the camera does the work and the audience watches. They build it. They test it. It looks great. They hand the build to a friend, who plays through twenty minutes of tutorial, defeats the first boss, and then — at the triumphant climax of the opening act — puts the controller down as the cutscene plays.

The designer watches this happen. They watch their friend's eyes drift to a phone. They watch the friend check a text, return to the screen during a pause in the music, shrug, and ask, "How long is this?"

This is the original sin of cutscene design: confusing the medium you love (games) with the medium you grew up on (films). It is a mistake every designer makes at least once. The craft of cutscenes begins with understanding why they are dangerous — and then, only then, learning to use them well.

Cutscenes break the contract. The contract you have with the player from the moment they press Start is: you are in control; your choices matter; you are the one doing this. The cutscene revokes that contract, temporarily. It says: for the next N seconds, watch. Every cutscene is a small betrayal of what the player came for. When the betrayal is worth it — when you show them something they could not otherwise see, land an emotional beat they could not otherwise feel — the trade is fair. When the betrayal is not worth it, the player remembers. They remember putting the controller down. They remember the moment the game stopped being a game.

This chapter is the practitioner's guide to when and how to take control away. We will look at the long arc of cutscenes as a technique — from pre-rendered movie-interludes to in-engine seamless takes to interactive cutscenes that try to split the difference. We will cover the Metal Gear Solid problem (when your game accidentally becomes a movie) and its opposite (God of War 2018's one-shot camera, which refused to cut at all). We will look at quick-time events, which are cutscenes pretending to be interactive, and which fail more often than they succeed. And we will implement a small, functional cutscene player in Godot — enough to insert a brief moment of authored cinema into your prototype at the right time.

By the end, you will have a working framework for deciding whether any given moment deserves a cutscene, how long it should be, whether it should be skippable, and how to transition in and out of it without jarring the player. Most of all, you will understand the single most important rule of cutscene design: when in doubt, make them shorter.

The Eternal Debate

Nobody agrees on cutscenes. At any game-dev conference, you can find two panels in adjacent rooms making opposite arguments.

On one side: cutscenes are cinema's gift to games. They let you tell stories the gameplay cannot. They deliver the emotional payoffs that gameplay alone cannot stage — the deaths, the reunions, the revelations. Without cutscenes, games could not have given us the ending of The Last of Us or the opening of Red Dead Redemption 2 or the quiet moments between Atreus and Kratos. They are how games reached the narrative heights of Final Fantasy VII and Metal Gear Solid and every beloved JRPG. Take cutscenes away, and games regress to the arcade era.

On the other side: cutscenes are a crutch. They are what designers reach for when they cannot think of a gameplay solution. They violate the interactivity that defines the medium. The greatest moments in games are not in cutscenes — they are in gameplay, when the player does something meaningful. Portal's ending triumph happens in gameplay. Shadow of the Colossus's grief happens in gameplay. Journey's transcendence happens in gameplay. Cutscenes are the parts of games that feel most dated twenty years later, because they show the game failing to be a game.

Both arguments are right. They are right about different things. The cutscene-defenders are right that authored cinema can land beats gameplay cannot. The cutscene-skeptics are right that gameplay is where the medium's specific power lives. What this means in practice: you use cutscenes surgically, not generally. You use them for specific payoffs. You keep them short. You make them skippable. You do not use them as an information-delivery mechanism or a reward for play, because these are jobs gameplay does better.

💡 Intuition: Treat cutscenes like injections, not like meals. An injection is brief, targeted, and delivered because nothing else will do the job. A meal is the main event. Games fail when they treat cutscenes as meals — when the cinematic is the main course and gameplay is what you eat between. The meal in a game is the play. The cutscene is the shot in the arm at the moment a shot is needed.

When to Use a Cutscene

Four legitimate reasons for a cutscene exist. Memorize them.

Narrative payoff. A scene that lands an emotional beat the player has been building toward. A death, a reunion, a revelation, a twist. These beats need authored framing because they require specific camera angles, specific timing, specific pacing. The ending of The Last of Us (the first one) works because the final scene is authored. Joel's lie to Ellie, Ellie's choice to believe him — these beats could not happen in free-form gameplay. They are cinema because cinema is the right tool.

Character moment. A scene that reveals something about a character that gameplay cannot reveal. Arthur Morgan coughing blood in Red Dead Redemption 2. Solid Snake's conversation with EVA at the end of MGS3. A boss's villainous monologue. A companion's confession. These are moments where characters speak and react and look at each other in ways that require the director's eye. Gameplay can get close to this, but cinema is stronger.

Showing the impossible. A scene that depicts events the gameplay engine cannot support — a massive army battle seen from a hilltop, a city being destroyed, a god descending from the sky, a character doing something too scripted or complex to leave to player input. Sometimes the spectacle exceeds the engine's interactivity budget. A cutscene lets you show it anyway.

Transition and pacing. A scene that bridges two gameplay sequences — the player defeats the dungeon boss, a cutscene carries them to the next region, the gameplay resumes. Used sparingly, these transitions help pacing. Used heavily, they start to feel like the game is on rails.

Notice what is not on this list. Explaining the controls. Introducing the world's lore. Rewarding the player. Establishing character backstory. These are common reasons for cutscenes that are almost always wrong. We turn to them next.

When NOT to Use a Cutscene

The single most common failure of cutscene design is using cutscenes for jobs that gameplay does better.

Tutorials

Never teach through cutscene. Never.

This rule admits exactly one exception, which we will get to, but the baseline is: when you need to teach the player a mechanic, you teach them through play. The player does the thing; the game lets them do it; they learn it in their muscles. A cutscene that shows them a mechanic teaches nothing — they watched a thing, they cannot replicate it, they will forget it in ninety seconds.

Portal teaches the entire portal-gun mechanic through graduated gameplay challenges, with no explanatory cutscene. By the time you are solving the complex puzzles, you have learned through play. Super Mario Bros. World 1-1 teaches Mario's entire movement vocabulary without a single word. Journey teaches its traversal mechanics through environmental affordances. The pattern is: play first, understand later.

The exception is when the mechanic is too complex or too rare to teach through play at the pace the game needs. Some fighting games use cutscenes to introduce a new character's moveset by showing the moves in action, because the player has seconds — not the minutes a proper gameplay tutorial would take — to absorb the information. Even here, the better pattern is a "practice room" adjacent to the tutorial cutscene, so the cutscene is the trailer and the practice is the school.

⚠️ Common Pitfall: The "tutorial-as-cutscene" mistake usually appears in mid-development, when the team realizes the opening is confusing and reaches for the nearest tool — a cutscene — to explain what the player should do. This always feels worse than a gameplay tutorial that was designed from the start. If your opening needs a cutscene to explain itself, the opening needs to be redesigned, not wrapped.

Information Dumps

If your cutscene's job is "tell the player about the world," you have misused the tool.

Games have better mechanisms for world-building. Environmental storytelling (Chapter 22) puts the world's backstory in the world itself — corpses, documents, ruins, graffiti. Dialogue with NPCs (Chapter 21) delivers lore at a pace the player controls. Codex entries let the curious player read deeply while letting the uninterested player play. Collectible audio logs — a BioShock staple — deliver plot-relevant exposition while the player explores.

Cutscenes are the least efficient exposition tool because they are the least skippable (if unskippable) and the least player-controlled. A player forced to sit through three minutes of lore-dump will retain maybe twenty percent of what was said. A player who reads the same lore on a wall at their own pace will retain more, feel less resentful, and absorb it into their mental model of the world.

The canonical failure is the "opening narrator cutscene," in which a voiced narrator tells the player about the kingdom, the war, the ancient prophecy, before the player has any stake in any of it. This was the standard opening for 1990s JRPGs and is almost always a mistake. The player has no reason to care yet. They will not care until after the gameplay has invested them in the world. Save the lore. Earn the player's interest first. Then, if you must, deliver the lore — but deliver it through a mechanism the player can engage with at their pace.

Rewarding the Player by Making Them Watch

This is the subtlest trap. You have designed a difficult challenge. The player finally defeats it. To reward their effort, you reward them with — a five-minute cutscene in which their character triumphs.

The player sits. They are not rewarded. They are being made to watch what they just did.

The reward for defeating a boss should be more gameplay, not the cessation of gameplay. New abilities. New areas. A better weapon. A shorter cooldown. A boss-defeat cutscene can work as a grace note — twenty seconds of ceremony to mark the transition — but if it exceeds thirty seconds, you have turned the player's reward into a punishment.

Dark Souls understands this. Boss deaths are celebrated with a few seconds of fade-to-black and a souls reward. No monologue, no triumph cutscene. The player's reward is that the door is open and more game lies beyond.

🎯 Tradeoff Spotlight: Cutscenes-as-reward is appealing because it is cheap spectacle. You can deliver a big moment without designing mechanical reward. But the spectacle-reward treats the player as an audience member rather than a player. Hold this line: rewards for play are more play, not the suspension of play.

In-Engine vs. Pre-Rendered

A cutscene is either in-engine (rendered by the game's normal rendering pipeline, using the same assets and characters the gameplay uses) or pre-rendered (a video file the game plays back, produced offline with any rendering technology the team wants to use).

Each has advantages. Each has a long history of famous examples.

Pre-Rendered

Pre-rendered cutscenes dominated the PlayStation 1 era. Final Fantasy VII (1997) is the canonical example: the game uses low-poly characters on pre-rendered backgrounds, but major story beats switch to full-screen pre-rendered CGI with photorealistic-for-the-time detail. The opening train sequence, the Jenova encounters, Aerith's death — these were delivered as video files because the in-game engine could not render them.

The advantage is visual fidelity. A pre-rendered cutscene can use film-quality rendering, particle effects, lighting, and animation that the real-time engine cannot approach. On the PS1, this was the difference between low-poly blocks and photographic cinema. Even today, on hardware an order of magnitude more powerful, teams occasionally use pre-rendered for moments where they want visual quality beyond what the engine can produce at runtime.

The disadvantage is disconnect. Pre-rendered cutscenes always look different from the gameplay. The character you were just controlling — textured, lit, moving in a certain way — suddenly becomes a different-looking character, rendered in a different style. The illusion of "this is one world" fractures at every cutscene.

Metal Gear Solid (1998) leans into this fracture by making its cutscenes stylistically distinct — codec-call cinematics, dramatic angle changes, movie-grammar — because the team wanted its cutscenes to feel like cinema. This worked because Kojima's entire design philosophy embraces the cut between game and film. For most games, the disconnect is a bug rather than a feature.

In-Engine

In-engine cutscenes are rendered by the same engine that renders the gameplay. Your character model is the same model. The lighting is the same lighting. The camera work, the animation, the effects — all produced within the game's runtime.

Naughty Dog's games (the Uncharted and Last of Us series) are the contemporary gold standard. Every cutscene is in-engine. When Joel speaks to Ellie in a cutscene, that is the same Joel and Ellie model you just controlled in gameplay. The transition from play to cutscene is seamless — camera pulls into position, character enters scripted animation, scene plays out, camera releases.

CD Projekt Red's Witcher 3 and Cyberpunk 2077 follow the same approach. Every conversation is in-engine. The continuity between play and scene is total.

✅ Best Practice: For a new project, default to in-engine cutscenes. The continuity benefits almost always outweigh the fidelity benefits of pre-rendered. Only reach for pre-rendered when you have a specific visual that the engine cannot produce — a massive army battle, a stylized dream sequence, a moment where the gameplay rendering itself would break the effect.

The advantage is continuity. The world is one world. The character is one character. Dynamic state carries through — if the player's armor is scarred, the cutscene's armor is scarred. If the player customized the character's appearance, the cutscene uses the customized appearance. The fiction holds.

The disadvantage is that the engine must render everything you want to show. If the cutscene needs ten characters, the engine renders ten characters. If it needs a massive explosion, the engine renders that explosion at runtime. This constrains what you can depict to what the engine can handle, and it pushes engineers to optimize for the worst case rather than the average.

The Hybrid

Some games mix the two. Blizzard's games traditionally used pre-rendered for major cinematic beats (the Warcraft III opening, the StarCraft II cutscenes) while using in-engine for smaller transitions and conversations. Final Fantasy XVI uses in-engine for most cutscenes and pre-rendered for the opening credits and a few signature spectacle moments.

The hybrid approach is sensible if your team has the capacity to produce both. Small teams should pick one and commit — usually in-engine, because pre-rendered production is expensive and slow.

A Brief History of Cutscenes

Before we go deeper, it helps to understand how the cutscene became a cornerstone of game design. The earliest cutscenes were not cinema at all — they were text interludes, blocks of descriptive paragraphs between levels in games like Pac-Man (1980, with its intermission cartoons) and the point-and-click adventures of the 1980s. These were tolerated rather than celebrated. Screen-space was limited, and memory was more limited still. A cutscene was two frames of pixel art and a sentence or two.

The PlayStation 1 era changed everything. Optical media — the CD-ROM — gave designers hundreds of megabytes to work with. Suddenly a game could ship with pre-rendered FMV (full-motion video) cutscenes produced in rendering software that was a decade ahead of what the real-time engine could do. Final Fantasy VII (1997) used this capacity to extraordinary effect, and for a generation of players, those pre-rendered cutscenes were the dramatic payoff of the game. The in-game graphics were blocky; the cutscenes were gorgeous. The cutscenes were the prize.

The Xbox and PlayStation 2 generation shifted the balance. By 2005, real-time rendering had caught up to what most pre-rendered cinema showed. Games like Resident Evil 4 could render their cutscenes in-engine at quality indistinguishable from their gameplay. The fidelity gap closed. Now designers had a choice: render ahead of time for absolute quality, or render in-engine for continuity. The debate has continued since.

The seventh and eighth console generations — PS3/Xbox 360 through PS4/Xbox One — saw the rise of the cinematic game as a dominant genre. Uncharted, The Last of Us, God of War, Red Dead Redemption, the Mass Effect and Dragon Age series, Heavy Rain, Detroit: Become Human, the Witcher series, the Final Fantasy series, Metal Gear Solid V. All made cutscenes central. The craft of in-engine cinematography matured into a discipline with its own best practices.

The current generation — PS5/Xbox Series X — continues the trajectory. Fidelity is no longer a constraint; disc-space is generous enough that cutscenes need not be compressed. The remaining design questions are structural: how long, how skippable, how integrated with gameplay, how authored versus reactive. These are the questions you, as a contemporary designer, must answer.

If you want a single panoramic tour of this history, play Final Fantasy VII Remake (2020) and Final Fantasy VII (1997) back-to-back. The same story, built twenty-three years apart, shows how cutscene technology evolved: from stylized pre-rendered flourishes to fully in-engine real-time scenes with interactive elements. The evolution is visible in every scene.

Interactive Cutscenes and Quick-Time Events

Quick-time events (QTEs) are the most controversial tool in this chapter. A QTE is a cutscene that occasionally asks the player to press a button. The button usually appears on screen for a brief window; pressing it correctly continues the cutscene; failing it costs health or triggers a different branch.

QTEs exist because designers, aware that cutscenes break the play contract, wanted to give the player something to do during them. The intuition is sound. The execution is mostly bad.

When QTEs Work

QTEs work when the pressed button corresponds to a meaningful action the player would want to take. They work when the timing window is generous enough not to punish inattention. They work when the consequence of failure is interesting rather than just "cutscene restarts."

The God of War series (both pre-reboot and 2018+) uses QTEs for signature combat finishers. Kratos grapples a cyclops, the player hits the buttons, Kratos delivers a brutal execution. These work because the buttons feel connected to the action — each press corresponds to a phase of the fight. They also work because the consequences matter: a failed QTE costs the player the execution and may cost them the fight.

Shadow of the Colossus (2005) uses a related technique: during colossus climbs, the player holds a grip button to avoid being shaken off. This is not strictly a QTE (the input is continuous rather than a button press) but it serves a similar function — keeping the player active during a scripted sequence. It works because the grip is a mechanic integrated with the rest of the climbing system, not a bolted-on prompt.

Heavy Rain (2010) built its entire gameplay around contextual QTEs, and for players who appreciated the style, it created a unique cinematic experience. The QTEs represent the characters' physical actions — tying a tie, negotiating a tight corner, lifting a heavy object — and the branching consequences mean that failing a QTE doesn't restart the scene but changes its outcome. This is a QTE done as a coherent gameplay loop, not an interruption.

When QTEs Don't Work

Most of them.

Resident Evil 5 (2009) scattered QTEs through its cutscenes with little thematic justification. Press A to jump a gap. Press X to dodge. Press B to punch. The buttons were semi-random; the consequences were mostly "you died and have to watch the cutscene again." The mechanic added nothing. It merely signaled distrust that the player was paying attention.

The Walking Dead (Telltale, 2012) and its successors used QTEs for action beats, and the same pattern emerged: the QTEs were often close to guessing games, the windows were tight, and failing them was punishing without being interesting. Telltale's narrative strengths lay elsewhere; the QTEs were a weakness.

The general failure mode of QTEs is that they are anxiety-generators rather than engagement-generators. The player is not empowered by being asked to hit a button; they are nervous that they might miss it and be forced to rewatch. The QTE becomes a filter between the player and the scene, rather than a bridge. If your QTE is essentially "press this button fast or die," you have added friction without agency. Consider whether the same moment could work as either pure cutscene (trust the player) or full gameplay (give them real control). The middle ground is usually the worst option.

A Practitioner's Rule for QTEs

If you are going to use QTEs, follow three rules. First, the button should thematically correspond to the action — press the grip button for a grip, the attack button for an attack, not an arbitrary mapping. Second, the window should be generous (at least 1 second for a single press). Third, failure should be interesting — the scene continues differently, the character takes damage, the outcome branches — not "restart from the last checkpoint."

If you cannot follow all three, the QTE is probably not earning its place. Cut it. Make the moment either a cutscene or full gameplay.

The Metal Gear Solid Problem

Hideo Kojima's Metal Gear Solid series is the definitional case study in cutscenes. Every MGS game — MGS1, MGS2, MGS3, MGS4, MGSV — has prodigious cutscenes that many players love and that almost as many critics consider excessive. MGS4 contains a single cutscene that runs seventy-one minutes. The game's final cutscene-to-gameplay ratio is lopsided enough that jokes about "Metal Gear Solid: The Movie" have been a running cultural reference for two decades.

I want to be careful here. Kojima's cutscenes are not bad. They are a coherent artistic choice. They contain the series' thematic depth, its self-referential metatextual games, its political philosophy. Kojima wants to make cinema and games in the same work. When the cutscenes succeed — the opening of MGS3, the Psycho Mantis fight, the ending of MGS4 — they succeed at things games rarely achieve.

But the design lesson from the series is cautionary: when cutscenes expand to fill the space available, they can swallow the game. MGS4's seventy-one-minute cutscene is not a triumph but a warning. Whatever Kojima wanted to say in that scene, the player was not playing the game for over an hour. The pacing rhythm that makes games feel like games — play, cutscene, play, cutscene, play — collapses when the cutscenes are longer than the gameplay sequences between them.

For most designers, the MGS lesson is: unless your name is Kojima, you do not have the license to do this. And even Kojima has moved away from the extreme — MGSV is far lighter on cutscenes than MGS4, reportedly because Kojima himself wanted to correct course. The series' own auteur concluded the pendulum had swung too far.

💀 Design Autopsy: The MGS4 Problem. In 2008, Metal Gear Solid 4: Guns of the Patriots shipped with cutscenes totaling, depending on counting method, between eight and ten hours of non-interactive footage in a game whose main campaign runs fifteen to twenty hours. Players could watch nearly half the game without touching the controller. Individual scenes ran routinely past ten minutes. The final cutscene, at seventy-one minutes, was longer than many feature films. Fans celebrated the narrative density and the long-awaited answers to series questions. Critics — and later, many of those same fans, on replay — noted that the game's pacing collapsed under the cutscene weight. The gameplay, which was often brilliant, became a minority activity. Kojima's next major game (MGSV) explicitly reduced cutscene density; whatever the exact reason, the auteur who built the cutscene-maximalist style pulled back. The lesson: when cutscenes become the dominant activity, the game becomes a worse version of a movie and a worse version of a game simultaneously.

Respect for Player Time

Some rules in game design are universal. This is one of them: cutscenes must be skippable.

Every cutscene. No exceptions. Not the opening cutscene. Not the ending cutscene. Not the "important" one. All of them, skippable, by a clearly-marked button press.

The reasons are not subtle. The player who encounters the cutscene on their first playthrough may or may not want to watch it. If they do, they don't need the skip button. If they don't, forcing them to watch is hostile. The player who replays the game — and you want players to replay the game — will absolutely not want to watch the cutscene a second time. Forcing them to is cruel. The speedrunner who wants to break your game in creative ways wants to skip cutscenes as a matter of course. Denying them that is denying a whole community of players.

Twenty years ago, cutscenes were often unskippable because of technical limitations (streaming from slow media made skip difficult) or because designers thought their cutscenes were too important to be skipped. Both reasons are defunct. Modern hardware can skip any cutscene. And no cutscene is too important to be skippable — the player who skips it accepts the consequence of missing whatever it contained.

The defaults matter. Skip should require an explicit input (hold Start, or a similar non-accidental gesture) so the player does not skip by mistake. But the option must be present, and it must be unambiguous.

✅ Best Practice: Implement cutscene skipping in your engine at the platform level, not per-cutscene. A single skip system that any cutscene can opt into. Then make sure every cutscene opts in. If you are about to ship a cutscene as unskippable, you have probably made a mistake — go back, re-examine the decision, and make it skippable unless you have an extraordinary reason.

Should Cutscenes Be Replayable?

Yes, usually. A cutscene gallery, unlocked as the player progresses, lets interested players revisit the game's narrative highlights. This is standard in many genres (JRPGs have used it for decades) and costs little to implement. Make the gallery accessible from the main menu after the scenes are first watched.

Cutscene-to-Gameplay Transitions

The moment the cutscene ends and gameplay begins is a design opportunity that most games squander. Done well, the transition is seamless — the camera smoothly returns to gameplay control, the player is oriented, and they can act immediately. Done poorly, the transition is jarring — the camera snaps to a different angle, the player is disoriented, they take a second to figure out which direction is forward.

The seamless transition requires planning the cutscene's final camera pose to match the gameplay's starting camera pose. If the cutscene ends with the camera behind the player looking forward, and gameplay begins with the camera behind the player looking forward, the transition is invisible. If the cutscene ends with a wide profile shot and gameplay begins with an over-the-shoulder, the cut is hard.

The Naughty Dog games (again, the gold standard) plan these transitions carefully. Watch the end of any major The Last of Us cutscene: the final shot is almost always composed so that the cut to gameplay is minimal. Sometimes it happens mid-animation — the character is in the middle of a walk, the cutscene hands control over, and the player picks up exactly where the authored animation left off.

For smaller teams, the pragmatic rule is: end your cutscene with a camera pose reachable from your default gameplay camera, and let the player move their camera from there.

The Camera Takes Over

The inverse problem is getting into the cutscene smoothly. The player is running around, they enter a trigger zone, and — what happens?

The naive approach is a hard cut: the gameplay camera vanishes, the cutscene camera appears, the cutscene begins. This is jarring. The player was controlling the camera; now something else is.

The better approach is a smooth handoff: the gameplay camera eases toward the cutscene's starting pose over a short interpolation (0.3-0.5 seconds), then the cutscene's camera takes over. The player sees the camera move as if under authored control, accepts that something is happening, and the cutscene begins without disruption.

Some games use a "camera takes over" signal — a subtle UI element, a hint of letterboxing, a change in music — to indicate that input is no longer needed. This gives the player a clean moment to put the controller down (if they want to) or keep watching actively.

When you design a cutscene that triggers on room entry — where the player might be running, possibly in combat, at an unpredictable camera angle — plan four things. First, the camera handoff: interpolate from current to cutscene camera over 0.3-0.5 seconds. Second, the character pose: snap or blend the character into the cutscene's starting pose. Third, gameplay state: pause enemies, clear projectiles, silence combat music. Fourth, input lockout: disable player input the instant the trigger fires, so a stray button press doesn't fire during the opening frame.

Length Guidelines

Here are practical numbers from observation and production experience.

Under 10 seconds. A beat. A reaction shot. A quick establishing moment. These do not meaningfully interrupt flow; the player barely registers that control was taken. Use these liberally when they serve a beat.

10-30 seconds. A scene. A short conversation, a dramatic moment, a small reveal. This is the sweet spot for most in-game cutscenes. The player can comfortably watch without fatigue; the designer can stage a genuine moment. Default to this range.

30-90 seconds. A long scene. A major dialogue, a significant plot beat, a character confrontation. Acceptable at major turning points but should not be common. Every one of these should be load-bearing — if you could cut it, you should.

90 seconds to 5 minutes. A cinematic. Reserved for the biggest moments in the game. Opening, major act breaks, climactic confrontations, the ending. You should have only a handful of these across the entire game.

Over 5 minutes. Approaching MGS territory. Requires an extraordinary justification. Most designers should never cross this line. If you are consistently shipping five-plus-minute cutscenes, your game is drifting toward film and away from play.

Over 20 minutes. This is an MGS joke. Do not do this. If you believe your game needs a twenty-minute cutscene, the correct response is to redesign the moment — split it across gameplay interludes, cut it in half, or rework the structure. If Kojima cannot reliably pull off a twenty-minute cutscene, neither can you.

⚡ Quick Reference: Cutscene length guidelines. - Under 10s: Free. Use as needed. - 10-30s: Default range for most scenes. - 30-90s: Reserved for major beats. - 90s-5min: Rare. Only the game's biggest moments. - 5-20min: Almost never justified. - 20min+: Do not. Redesign the moment.

Camera Language in Cutscenes

When the cutscene begins, you are directing a film. Film grammar applies. This is a full skill in itself, but a few rules carry most of the weight.

Establishing shots come first. Before you cut to a close-up of a character, show where they are. The viewer's mental map of the scene is built from wide shots; close-ups live inside that map.

The 180-degree rule. When two characters face each other, the camera should stay on one side of the imaginary line between them. Crossing the line disorients the viewer — characters appear to swap positions. Either stay on one side, or cross deliberately with a transition shot (a neutral angle) in between.

Cut on motion. Cuts that happen during visible movement are invisible; cuts from a static shot to a static shot are jarring. Edit your scene so cuts happen mid-gesture, not mid-pause.

Eye lines should match. If character A is looking screen-right at character B, the cut to character B should show them looking screen-left back. Mismatched eye lines make the conversation feel wrong without the viewer knowing why.

Shot variety. A conversation shot entirely in a single wide shot feels flat. A conversation shot entirely in alternating close-ups feels frantic. Mix wide, medium, close, and insert shots. Let the editing rhythm breathe.

Cutscenes that feel amateurish usually violate these rules. Cutscenes that feel professional usually follow them without the viewer noticing.

🎓 Advanced: Advanced cutscene cinematography includes shot lengths tuned to emotional beat, color temperature shifting across a scene to signal mood change, depth-of-field pulls to direct attention, and "invisible" editing that follows the actor's gesture even across cuts. Sergei Eisenstein's theory of montage and contemporary screenwriter Blake Snyder's Save the Cat both offer vocabulary worth adopting. Game directors like Cory Barlog (God of War) and Neil Druckmann (The Last of Us) have spoken publicly about adopting professional cinematography practices for cutscene work. Study their talks.

The Seamless One-Shot Camera

A specific technique bears its own section because it has reshaped how the industry thinks about cutscenes in the last ten years: the seamless one-shot camera.

The idea: the camera never cuts. From the moment the player starts the game until they finish the final credits, the camera is a single continuous take. No cuts between gameplay and cutscene. No cuts between scenes. No black frames, no loading transitions that break the continuity. The player never sees a cut.

This sounds insane. It is, in some respects, insane. But it also works — brilliantly — in God of War (2018), Cory Barlog's reboot of the series. The game commits to the no-cuts philosophy across twenty-plus hours of gameplay and cutscenes, and the commitment transforms the player's relationship to Kratos. You are with him, continuously, without the editorial breathing room that cuts provide. When something traumatic happens, you cannot look away — the camera does not cut away. When something beautiful happens, you are in it.

Case Study 23.1 covers God of War 2018 in detail. The technique is not for every game. But the philosophical lesson applies beyond Barlog's specific implementation: think about cuts as deliberate interventions rather than editorial necessities. Every cut is a choice. A cutscene that enters and exits without cuts tells the player something different than a cutscene that begins with a fade to black and ends with a fade back. Consider what you want to say.

Environmental Storytelling as Alternative

Many moments that designers reach for cutscenes to deliver can be delivered instead through environmental storytelling (Chapter 22). Before you write a cutscene, ask: could this moment be shown through the environment? Through a corpse's pose, a journal entry, a room's arrangement?

Dark Souls delivers almost all its world's backstory through environmental and item-description storytelling. There are cutscenes — a handful, at key moments — but the bulk of the world is built through implication. The result feels deeper and more discoverable than an equivalent amount of cutscene exposition would.

Even when you decide a cutscene is the right tool, environmental storytelling can shorten it. A cutscene that opens with the player already in a ruined room — ruins you staged in the environment — does not need to establish the state of the room. The environment does that work. The cutscene can get to the emotional beat faster.

Cutscenes and environmental storytelling are complementary tools. Environmental storytelling shows the before-and-after; cutscenes depict the during. A ruined city (environment) plus a cutscene depicting the moment of ruin (scene) is a stronger narrative combination than either alone. Plan them together.

Audio in Cutscenes

Music and sound design are often where the cutscene lives or dies. A cutscene with weak audio feels like a student film — silent gaps, level mismatches, a score that neither underscores nor retreats. A cutscene with strong audio feels intentional, weighted, deliberately composed.

Three rules carry most of the weight.

The score should not be the scene. A cutscene needs music that supports the scene's emotional arc, but the music should serve the scene, not the other way around. A common amateur mistake is to score the cutscene at maximum intensity from the first frame, which leaves nowhere to go. Save your crescendos. Let the music start quietly and swell when the beat calls for it. If every cue is the same volume, the audio mix flattens and the viewer stops perceiving it.

Dialogue must be mixed above ambient sound. This is a technical concern but a critical one. In gameplay, the player often has volume sliders that favor sound effects; in cutscenes, dialogue must be clearly audible above the music and ambient noise. Target dialogue sitting 6-10 decibels above the score during speaking moments. Many games ship with cutscene audio that's inaudible during quiet lines — the music and ambient drown out the voice. Test this ruthlessly, especially with headphones versus speakers versus TV setups.

Sound design fills silence. A cutscene without ambient sound feels artificial. If two characters are talking in a forest, the forest should quietly exist — distant wind, bird calls, insects. If they're on a spaceship bridge, the hum of computers, occasional alerts, ambient crew chatter. Silence is a tool, but it must be used deliberately. Unintentional silence reads as budget.

Beyond these rules, consider the transition. The audio at the moment of cutscene entry should bridge — the gameplay ambient should fade as the cutscene score rises, not cut abruptly. Same for exit: the score should fade as gameplay ambient returns. A seamless audio transition contributes to a seamless visual transition.

Subtitles are not optional. Always include them. Make them on by default or clearly offer them in the first-run setup. Players with hearing impairments, non-native speakers, players in noisy environments, players who turned the volume down because someone is asleep in the next room — all rely on subtitles. This is table stakes, not a bonus feature.

Audio checklist for any cutscene: dialogue mixed 6-10 dB above score during speech; score dynamics built across the scene (quiet start, build to beat); ambient sound present even in quiet scenes; audio transitions (into and out of the cutscene) smooth, not hard-cut; subtitles implemented and on by default; tested on headphones, speakers, and TV.

Dynamic State in Cutscenes

A subtle but consequential design choice: how much of the game's dynamic state carries into cutscenes?

In the simplest case, the cutscene ignores gameplay state. The character in the cutscene wears the default outfit regardless of what armor the player equipped. The characters present are the ones the writer specified, regardless of which companions the player brought along. The cutscene is a fixed authored moment.

In the most responsive case, the cutscene adapts. The character wears the armor the player is currently wearing. The companion who speaks is the one the player actually brought. If the player made a choice three chapters ago, the cutscene reflects it — different dialogue, different characters, different outcomes.

The tradeoff is authorial cost against player recognition. Fixed cutscenes are cheap to produce. Adaptive cutscenes are expensive but deeply rewarding when they hit — the moment when the player realizes this cutscene is different because of a choice I made is one of the most satisfying experiences games can offer.

Most games choose a middle path. Major cutscenes are fixed, produced to a final polish. Minor conversations and transitional moments adapt to player state. The player sees their character's current outfit in a fireside scene; the climactic boss cutscene is authored without regard to outfit. This is the compromise that shipped Mass Effect, The Witcher 3, Baldur's Gate 3, and most contemporary narrative games.

Baldur's Gate 3 (2023) pushes adaptive cutscenes further than most. The cast of any given scene depends on who you brought. Romance scenes adapt to your chosen partner, your character's race, even your character's appearance. This is enormously expensive — Larian's production estimates include thousands of hours of motion-capture for variant scenes — but it is part of why the game feels like your story rather than a story you happened to play.

For a prototype, choose the simpler path. Fix the cutscene's cast and state. Reward yourself for shipping a cutscene that works at all. Adaptive cutscenes are a later-stage feature, built on top of a stable authored foundation.

If you want to see adaptive cutscenes in action, play the opening hour of Baldur's Gate 3 twice with radically different characters — say, a small elven rogue and a large half-orc barbarian. Observe how the cutscenes adapt: the companion reactions, the camera framing, the dialogue variants. Then note the cutscenes that don't adapt — the ones where the character is fixed or the scene is pre-authored without variation. The pattern reveals where Larian spent their cutscene budget and where they chose not to.

Implementing a Basic Cutscene Player

Here is a small, working cutscene player for Godot. It uses AnimationPlayer to drive camera movement and character position over a fixed timeline, with a signal emitted at the end so gameplay can resume.

# CutscenePlayer.gd
# Attach to a Node that has an AnimationPlayer child and references
# to the camera and characters to be animated.
extends Node

signal cutscene_started
signal cutscene_finished

@export var animation_player: AnimationPlayer
@export var cutscene_animation: String = "intro_scene"
@export var gameplay_camera: Camera3D
@export var cutscene_camera: Camera3D

var _is_playing: bool = false

func play_cutscene() -> void:
    if _is_playing:
        return
    _is_playing = true
    cutscene_camera.current = true
    get_tree().paused = false
    emit_signal("cutscene_started")
    animation_player.play(cutscene_animation)
    await animation_player.animation_finished
    _finish()

func skip_cutscene() -> void:
    if _is_playing:
        animation_player.stop()
        _finish()

func _finish() -> void:
    _is_playing = false
    gameplay_camera.current = true
    emit_signal("cutscene_finished")

func _input(event: InputEvent) -> void:
    if _is_playing and event.is_action_pressed("ui_cancel"):
        skip_cutscene()

Twenty-five lines of GDScript. The work lives in the AnimationPlayer (authored in the editor) where you key camera position, camera rotation, character position, and any other properties over time. The script's job is small: start the animation, swap to the cutscene camera, listen for skip input, and fire a signal when done.

A more sophisticated cutscene player would support multiple animation tracks (for multi-character scenes), a dialogue UI that advances with the animation, and transitions in and out of the cutscene camera. The pattern, though, is the pattern: an animation timeline drives property changes, a camera takeover is tracked explicitly, and the system emits a completion signal so the rest of the game knows when control returns.

Usage, in your level script:

# When the player enters the trigger
func _on_cutscene_trigger_body_entered(body: Node3D) -> void:
    if body.is_in_group("player"):
        $CutscenePlayer.cutscene_finished.connect(_on_cutscene_done)
        $CutscenePlayer.play_cutscene()

func _on_cutscene_done() -> void:
    # Resume gameplay, enable player input, etc.
    $Player.set_process_input(true)

This is a minimal implementation. Production cutscene systems get more elaborate — support for subtitles, audio sync, letterboxing UI, seamless camera blend, dynamic lookup of characters so the cutscene adapts to which characters are present, and so on. For your prototype, the 25-line version is sufficient. When you need more, extend this base rather than rebuilding from scratch.

Should Cutscenes Be Replayable?

Yes — when it's cheap to support. Offer a "cutscene gallery" in the main menu that unlocks scenes as the player reaches them. This lets players revisit the narrative without replaying the whole game. Make it optional; not every game needs it.

For first-party release, aim to implement the gallery early so you don't ship without it. It is a small feature that dedicated fans will remember.

Project Checkpoint

At this point in the book, you should have a small working prototype with gameplay loop, dialogue, and the beginnings of environmental storytelling. This chapter asks you to add one or two brief in-engine cutscenes at key narrative moments.

📐 Project Checkpoint: Add 1-2 brief in-engine cutscenes to your prototype, at key narrative moments you have identified. Keep each under 30 seconds. Use the CutscenePlayer.gd pattern from this chapter. Ensure every cutscene is skippable. After implementing, playtest — watch at least one playtester play through each cutscene. Do they watch it fully? Do they skip it? Does the transition in and out feel intentional? Iterate based on what you see. Resist the urge to extend the cutscenes past 30 seconds; the prototype's job is to demonstrate that the tool is integrated, not to deliver a cinematic showpiece.

Deciding When You Actually Need a Cutscene

We close this chapter with a practical decision framework. Before you build any cutscene, ask these five questions.

  1. What specific beat does this cutscene land? Name it in one sentence. "Ellie learns Joel lied to her." "The player sees the city destroyed." "Kratos meets his father for the first time." If you cannot name the beat, the cutscene does not have a clear purpose.

  2. Could gameplay deliver this beat? Be honest. Could a controllable scene, even a constrained one, achieve the same effect? If yes, strongly prefer gameplay.

  3. Could environmental storytelling deliver it? Could the scene's outcome be implied by a room's state, a corpse's position, a note left behind, rather than a scene depicting the outcome?

  4. How long will this cutscene be? Target under 30 seconds. If you are drafting something longer, ask what you can cut. The first cut you make will hurt; the second will feel fine; by the third, the scene is tighter and better.

  5. Will this cutscene be skippable? The answer is yes. If you feel the answer should be no, you are wrong. Make it skippable.

If you can answer all five honestly and the cutscene still earns its place, build it. If any question gives you pause, redesign the moment. The discipline of saying no to cutscenes is what separates designers who use cutscenes well from designers who reach for them whenever the writing gets hard.

Think of one scene in a game you love that you believe is perfect. Now ask: could it have been delivered without a cutscene? How? What would the gameplay version look like? Would it be better, worse, or simply different? There is no right answer — the point is to develop the habit of questioning whether any given moment needs the cutscene treatment. Some moments will survive the questioning; some will not. The designers whose cutscenes consistently land are the ones who have done this questioning on every moment they shipped.

The Close of Part V

This chapter ends Part V. Across four chapters, we have built a full narrative toolkit: the question of what stories games tell (Chapter 20), the craft of writing dialogue and characters (Chapter 21), the discipline of environmental storytelling (Chapter 22), and now the careful use of cutscenes (Chapter 23).

You have four distinct tools for conveying narrative: gameplay (what the player does), dialogue (what characters say), environment (what the world shows), and cinema (what the camera authors). Each is appropriate for different beats. Each fails in different ways. The designer's job is to choose, moment by moment, which tool fits.

When the tool is gameplay, use gameplay. This is your strongest channel. Players care most about what they do.

When the tool is dialogue, write economically. Characters should speak the minimum necessary to land the beat.

When the tool is environment, trust the player to read. The world tells its own story to those who look.

When the tool is cinema, use it briefly. Take control away for specific beats. Give it back as soon as the beat lands. Respect the player's time.

The games that integrate these four tools gracefully are the ones whose narratives people remember. The Last of Us. Red Dead Redemption 2. Dark Souls. Portal. God of War. Not because they used any one tool brilliantly, but because they used all four, and knew when to switch between them.

Part VI turns to art direction and visual design — the craft of how your game looks and feels. The narrative you have now planned will land differently depending on the world's aesthetics. A melancholy story in a cheerful art style creates one effect; the same story in a grim palette creates another. The next part is about making that choice deliberately.

🚪 Threshold Concept: Cutscenes are not part of the game. They are interruptions to the game. This inversion of perspective — treating the cutscene as an unwelcome guest that must justify its stay, rather than as a storytelling tool you naturally reach for — is the single most important shift in cutscene design. Many designers pass through this threshold and never think about cutscenes the same way again. On the other side, you add cutscenes reluctantly, cut them aggressively, and end up with fewer but better cinematic moments. Before you close this chapter, sit with three questions. First: In the game you are currently prototyping, how many cutscenes are you planning? If the answer is more than three or four, ask whether you have earned each one. Second: Of those cutscenes, how many are truly under thirty seconds? The ones over thirty seconds are the ones most likely to need a redesign. Third: Have you ever been frustrated, as a player, by a cutscene you could not skip? What made that frustration memorable? Design with that memory active. Cross the threshold.

A Final Exercise

Before the exercises chapter proper, one last thought experiment. Imagine a version of your prototype with zero cutscenes. Every narrative beat must be delivered through gameplay, dialogue, or environment. What would that version look like? Where would the story weaken? Where would it strengthen? Where would you miss the cutscene tool most?

This exercise is not an argument for zero cutscenes (though some great games have none). It is an argument for earning the cutscenes you include. If your game would be 85% as good with no cutscenes, your cutscenes are doing 15% of the narrative work. Make sure they do more than that. Make each one count.

When you are ready, continue to the exercises, which will have you analyze cutscenes in three games, storyboard one for your prototype, implement the player, and critique a famous example. By the time you finish Part V, your narrative toolkit is complete.

Turn the page.