Press the shutter on a modern phone in a dim restaurant and something strange happens. There is a pause —
Prerequisites
- 2
- 3
- 4
- 5
- 22
Learning Objectives
- Explain how a modern phone computes a single photograph from a burst of many frames, and name the stages of that pipeline.
- Describe what HDR and tone mapping do to a scene's dynamic range, and recognize when the result is faithful versus overcooked.
- Explain how night mode aligns and stacks frames to beat noise in the dark, and predict when it will fail.
- Explain how portrait mode builds a depth map and applies synthetic bokeh, and spot the artifacts that give it away.
- Identify the computational color, sharpening, and segmentation steps that produce the modern 'phone look,' and dial them back when they lie.
- Decide, scene by scene, when to trust a computational mode and when to override it with manual or RAW capture.
In This Chapter
- Overview
- Learning Paths
- 23.1 From one frame to many: multi-frame capture and stacking
- 23.2 HDR and tone mapping
- 23.3 Night mode: alignment, stacking, and noise
- 23.4 Portrait mode: depth maps and synthetic bokeh
- 23.5 Computational color, sharpening, and the "phone look"
- 23.6 When computation helps and when it lies
- Portfolio Checkpoint
- Summary
- Spaced Review
- What's Next
Chapter 23: Computational Photography: How Night Mode, Portrait Mode, and HDR Actually Work
"The best camera is the one that's with you." — attributed to Chase Jarvis
Overview
Press the shutter on a modern phone in a dim restaurant and something strange happens. There is a pause — half a second, maybe a full one — and then an image appears that is brighter, cleaner, and more detailed than the scene your eye is looking at. The shadows under the table are open. The candle flame on the table is not blown out. The faces are sharp and noise-free. It looks, frankly, better than it should. You did not take that photograph. Your phone computed it, from a burst of frames you never saw, fused and cleaned and tone-mapped by software in the time it took the screen to flash.
This is the defining fact of capture in this decade, and it is the subject of this chapter. The phone in Chapter 22 was a real camera with real limits — a tiny sensor, a fixed-ish lens, a small bucket for light. Computational photography is the set of tricks that work around those limits not with better glass but with more frames and more math. It is why a sensor the size of a fingernail can produce images that, ten years ago, would have required a camera you carried in two hands. And it is why, sometimes, the same phone produces an image that is subtly, uncannily wrong — a haloed edge around someone's hair, a sky that never looked that blue, a face smoothed into plastic.
The whole point of opening this black box is judgment. Once you understand what your phone is actually doing when it stacks, fuses, segments, and tone-maps, you stop being a passenger. You learn the situations where computation rescues a shot you could not otherwise get, and the situations where it quietly invents detail, flattens a mood you wanted, or mangles an edge — so you can turn it off, shoot RAW, and do the work yourself. The camera got smart. This chapter is how you stay smarter than it.
In this chapter, you will learn to:
- Understand the single biggest idea in modern phone photography: a photograph is increasingly computed from many frames, not recorded in one.
- Read what HDR and tone mapping do to a scene's range of brightness, and tell a faithful HDR from an overcooked one at a glance.
- Explain how night mode beats darkness by aligning and averaging a burst, and predict exactly when it will smear, ghost, or fail.
- See how portrait mode builds a depth map and fakes the soft background a fast lens would give you — and catch the artifacts that reveal the fake.
- Recognize the computational color, sharpening, and segmentation that create the instantly familiar "phone look," and dial them back when they lie about the scene.
- Make the override decision deliberately: when to trust the mode, when to lock exposure and shoot it straight, and when to capture RAW and compute it yourself later.
Learning Paths
📱 Mobile-only: This chapter is the heart of your craft — your camera is a computational camera, and every section here is about driving it with understanding instead of hope. Read all of it. Pay special attention to §23.6, where the override decisions live. 🎨 Hobbyist: Even if you shoot a dedicated camera, your phone is your always-present second body, and increasingly your "real" camera runs versions of these tricks too (in-camera HDR, multi-shot modes). The conceptual sections (§23.1–§23.4) transfer directly; §23.5 explains why phone files often need less editing and sometimes more. 💼 Pro-track: Clients will hand you phone files and ask why they "look like a phone." Knowing exactly which computational step did what — and how to defeat it by shooting RAW (§23.6) — is a professional competency now. The 🔬 The Physics boxes are worth your time. 🎓 Student: §23.1's pipeline and the three Physics sidebars (stacking, tone mapping, depth estimation) are the assessable core. The Portfolio Checkpoint asks you to document what computation added and what it faked — a critical-seeing exercise as much as a shooting one.
23.1 From one frame to many: multi-frame capture and stacking
Start with the scene that makes the whole chapter necessary. You are indoors, late afternoon, photographing a friend who is sitting beside a bright window. Your eye sees both of them clearly and the street outside the glass. But the scene contains an enormous range of brightness — the sunlit street is many, many times brighter than your friend's shadowed face. A single exposure cannot hold both. Expose for the face and the window blows out to featureless white. Expose for the window and the face goes to mud. This is the oldest problem in photography, and for a hundred years the only answers were to add light, accept the loss, or pick your battle.
A computational camera answers differently. Instead of taking one frame, it takes several — some darker, some brighter, captured in a rapid burst — and then combines them into a single image that holds detail the way no one frame could. That combination is the foundational move of this whole field. When the frames are merged to reduce noise, we call it stacking; when they are merged to extend the range of brightness, we call it HDR (§23.2); when they are aligned and averaged in the dark, we call it night mode (§23.3). All of them rest on the same idea: more frames, intelligently combined, beat one frame every time.
Let us name the field and the core technique precisely. Computational photography — which you met in Chapter 22 as the reason a phone punches above its sensor — is photography in which the final image is produced by software combining and processing data from one or more captures, rather than by a single exposure recorded as-is. Image stacking is the specific technique of capturing multiple frames of the same scene and merging them into one, pixel by pixel, to achieve something a single frame cannot: less noise, more dynamic range, more depth of field, or sharper detail.
🔗 Connection: This is not new with phones. In Chapter 14 you stacked frames for focus (focus stacking, to get a bug sharp from antenna to tail), and in Chapter 21 you effectively stacked time with a long exposure to smooth water and draw light trails. Computational stacking on a phone does the same kind of thing — combine many captures into one better one — but it does it automatically, in under a second, and aligns the frames itself.
Why many frames beat one: the noise problem
The single most important reason to stack is noise — the random speckle, the gritty texture, that infests photos taken in low light or at high ISO (you met it as the cost of cranking ISO in Chapter 3). Here is the intuition, and it is worth holding onto because it explains night mode, HDR shadows, and half of what your phone does after dark.
Imagine the light landing on one pixel of your sensor. Light arrives in discrete little packets (photons), and in dim conditions only a few land during the exposure — and crucially, the number that arrive flickers randomly from frame to frame, even pointed at the same unchanging wall. That random flicker is the noise. The real signal — the actual brightness of the wall — is steady. So if you capture the same wall many times and average the frames, the random flicker partly cancels itself out (it is high one frame, low the next), while the steady signal reinforces. The wall gets cleaner; the speckle melts away.
FIGURE 23.1 — Why stacking cleans up noise (one pixel, many frames)
A single dim frame: signal ≈ 100, but noise jitters the reading wildly
frame 1 reads 118 ████████████ (too high)
frame 2 reads 86 ████████▌ (too low)
frame 3 reads 104 ██████████▌
frame 4 reads 92 █████████▏
frame 5 reads 110 ███████████
...the true value is 100, but any ONE frame is off by a lot
Average of 5 frames ≈ 102 ──► much closer to the true 100, far smoother
Average of 20 frames ≈ 100 ──► the speckle is nearly gone
The SIGNAL (real brightness) adds up; the NOISE (random jitter) averages toward zero.
More frames → cleaner image. This is the engine inside night mode (§23.3).
That is the entire engine of multi-frame capture, stated in one figure. The signal is consistent across frames, so it survives averaging; the noise is random, so it shrinks. Stack four frames and the noise drops noticeably. Stack sixteen and a grainy mess becomes a usable photo. Phones exploit this constantly — quietly, even in "normal" mode, your phone is often capturing and merging a short burst every time you press the shutter. The pause you feel after the click in dim light is the stack being captured and fused.
How to shoot it: get out of the camera's way
For most of this chapter the "how to shoot" is less about new buttons and more about enabling the machinery to work and then judging the result. Three habits make every multi-frame mode succeed:
- Hold still through the pause. The frames must be aligned before they are merged. Phones are astonishingly good at aligning small, fast handheld wobble — but they cannot align what moves too far or too fast. A braced grip, elbows tucked, breathing held for the half-second of capture, gives the alignment step an easy job. (Chapter 22's bracing and Chapter 21's "be a tripod" carry straight over.)
- Tap to set exposure before you press. The phone decides how many frames and how dark/bright to make them based on where you tell it the subject is. Tap your subject; if needed, drag the exposure slider so the important thing looks right before capture. You are aiming the computation, not just the focus.
- Let it finish. Moving the phone away the instant the screen flashes can cut a stack short or leave it un-merged. Wait for the thumbnail to settle. It takes a moment; the moment is the point.
⚠️ Common Mistake: Jerking the phone away the instant you hear the shutter sound. In bright light this is harmless — the capture was a single fast frame. But in dim light, HDR, or night mode, the shutter sound is the start of a multi-frame capture that runs for a noticeable fraction of a second (or several seconds in night mode). Move early and you get a smeared, half-merged, or ghosted result, then blame the phone. Watch the capture indicator; hold until it's done. The single biggest improvement most people can make to their computational shots is simply waiting.
💡 Why It Works: Stacking turns time into quality. A bigger sensor gathers more light per frame in space; a stack gathers more light across frames in time. Both end up with more total signal relative to noise — which is the real currency of image quality. This is why a tiny phone sensor, given a second to capture and combine a dozen frames, can rival a much larger sensor's single frame. It "borrows" the light it lacks in area by spending time instead. The catch — and the theme of this whole chapter — is that borrowing across time only works when the scene holds still long enough to be borrowed from.
🔄 Check Your Eye: 1. In one sentence, why does averaging many frames of the same still scene reduce noise but not erase the subject? 2. You photograph a dim room handheld and the result is sharp and clean; you photograph a passing cyclist in the same dim room and they come out smeared. Same phone, same light — what changed?
Answers
- The subject's brightness is the same in every frame, so it reinforces when averaged; the noise is random each frame, so it partly cancels. Signal adds, noise averages toward zero. 2. The room held still across the burst, so its frames aligned and stacked cleanly; the cyclist moved between frames, so there was no consistent signal to reinforce — motion breaks the core assumption that the scene is the same in every frame. Computation can't average something that isn't there twice.
23.2 HDR and tone mapping
Return to your friend by the bright window. The problem was dynamic range — the distance, measured in stops, between the darkest and brightest parts of a scene you want to keep. A sunlit-window-and-shadowed-face scene might span twelve or more stops. A phone sensor in a single exposure can cleanly hold maybe nine or ten. The scene is bigger than the bucket. Something has to clip — go pure black or pure white with no detail.
The computational answer is HDR, short for high dynamic range. The phone captures multiple exposures of the same scene — typically one metered for the shadows (a brighter exposure that opens the dark face) and one or more metered for the highlights (darker exposures that preserve the window) — and merges them, taking the good shadow detail from the bright frames and the good highlight detail from the dark frames. The result is a single image that contains more range than any one capture: face and window, both with detail.
But there is a second, subtler step that matters more for how the photo looks. The merged HDR data may contain twelve stops of range, but your phone screen, a print, and the human comfort zone all want something nearer eight or nine stops. So the phone must compress that wide range back down to fit a normal display — deciding how to map the scene's full brightness onto the limited brightness a screen can show. That compression step is tone mapping: the process of remapping a wide range of captured brightness into the narrower range a display or print can reproduce, deciding which tones to lift, which to hold, and how hard to compress the extremes.
Tone mapping is where HDR earns its love and its hate. Done with restraint, it gives you what your eye saw: open shadows, controlled highlights, a natural-feeling photo. Done aggressively, it produces the infamous "HDR look" — flat, grey, over-detailed images where the shadows and highlights are crushed toward the middle, local contrast is jacked up until every texture screams, skies turn an unnatural slate, and the whole thing has the queasy, internally-lit quality of a video game. The data was fine; the mapping was heavy-handed.
FIGURE 23.2 — Single exposure vs. HDR merge vs. tone mapping (a backlit window scene)
THE SCENE spans ~12 stops: [shadowed face] ........................ [sunlit window]
dark bright
ONE EXPOSURE, metered for the face: face OK, window CLIPS to white
░░▓▓██ face | WINDOW = ▒▒▒▒▒▒ (blown, no detail) ████████ → white wall
ONE EXPOSURE, metered for the window: window OK, face CLIPS to black
▓▓ face (mud) | window = ░░▒▒▓▓ (sky, frame, detail all there)
HDR MERGE takes the best of each: face from the bright frame + window from the dark frame
░░▓▓ face (detail!) | window ░░▒▒▓▓ (detail!) — now 12 stops of REAL range captured
TONE MAP fits 12 stops into a ~9-stop display:
GENTLE → looks like your eye saw it: open shadows, held highlights, natural
HEAVY → the "HDR look": flat grey, halos at edges, over-textured, fake-bright
The MERGE recovers the range. The TONE MAP decides whether the photo looks real or cooked.
How to shoot HDR well
The good news: on most phones, HDR is automatic and usually tasteful, and the merge is genuinely useful. Your job is to recognize when it helps, aim it, and judge whether the tone mapping went too far.
- Use it for high-contrast scenes: backlit subjects, interiors with bright windows, landscapes where the sky is far brighter than the land, anything where you'd otherwise have to "pick your battle" between shadows and highlights. This is HDR's home turf and where it rescues otherwise-impossible shots.
- Tap for the highlights, let HDR rescue the shadows. Counterintuitively, in a backlit scene you often get the most natural HDR by setting exposure for the bright area (so highlights aren't blown beyond recovery) and trusting the merge to lift the shadows. Blown highlights have no data to recover; dark shadows usually do.
- Watch for the halo. The tell-tale sign of heavy-handed tone mapping is a soft glow or dark fringe along high-contrast edges — a pale halo around a dark roofline against a bright sky, for instance. If you see it, the processing overreached.
- If your phone offers "natural" vs. "rich/vivid" HDR, prefer natural for scenes you want to look real, and reserve the punchy version for graphic, deliberately-stylized images.
⚠️ Common Mistake: Over-trusting auto-HDR for mood shots. HDR's instinct is to open every shadow and recover every highlight — to make everything visible. But many of the best photographs depend on things not being visible: a moody low-key portrait where the shadow side falls to near-black, a silhouette against a sunset, a single shaft of light in a dark room. Auto-HDR will "fix" the very darkness that made the image work, flattening drama into documentation. When you want shadows to stay shadows, turn HDR off (or shoot a single, deliberately-dark exposure) and protect the mood. Range is not always the goal; sometimes the absence of range is the photograph. (This connects straight back to the low-key work in Chapter 8.)
🔬 The Physics: (Optional — skip without penalty.) What does a tone-mapping algorithm actually decide? Picture the merged HDR image as a tall stack of brightness values, far taller than a screen can show. A global tone curve squashes that stack uniformly — like turning down contrast — which is safe but can leave the midtones muddy. Local tone mapping is smarter and riskier: it divides the image into regions and brightens or darkens each according to its neighborhood, so a dark face can be lifted without also washing out the bright window beside it. The power of local tone mapping — and its danger — is that it manipulates local contrast. Push it and you get the over-detailed "every brick pops" HDR look, plus the halos where a bright region and a dark region meet (the algorithm can't decide how much to lift each side, so it leaves a gradient — the halo). Modern phone HDR is mostly local tone mapping, tuned by the manufacturer's taste. That "taste" is why two phones photograph the same sunset and produce visibly different files: same physics, different mapping decisions. None of this is required to use HDR well — just to understand why heavy HDR looks the way it does.
📸 In the Field — HDR on and off, same scene. Find a high-contrast scene you can shoot from a fixed spot: a window-lit room with the bright outdoors visible, or a building's dark side against a bright sky (the city block or your kitchen window location both work). Shoot it three ways: (1) HDR off, exposed for the highlights; (2) HDR off, exposed for the shadows; (3) HDR on. Look at all three together. Where did the single exposures clip? What did HDR recover — and does the tone mapping look natural or cooked? Keep the most honest one and one sentence on why. You are training the judgment to know when to let the computation work.
23.3 Night mode: alignment, stacking, and noise
Now the most spectacular trick your phone does, and the one that most often looks like magic: night mode. A scene so dark your eye can barely navigate it — a dim street, a candlelit table, a room lit by one lamp — produces, after a few seconds, a bright, clean, detailed photograph. People consistently underestimate how much computation this takes, and consistently misuse it because they don't understand its one fatal weakness.
Night mode is the noise-stacking idea from §23.1 turned up to maximum and combined with clever alignment. Here is the sequence, which is worth knowing because every limitation falls out of it:
FIGURE 23.3 — What night mode actually does (a few seconds of work, step by step)
1. CAPTURE a burst of many short frames over 1–6 seconds (sometimes longer on a tripod).
┌─┐┌─┐┌─┐┌─┐┌─┐┌─┐┌─┐┌─┐ each frame is dark and noisy on its own
└─┘└─┘└─┘└─┘└─┘└─┘└─┘└─┘
2. ALIGN every frame to a reference, correcting your handheld wobble between frames.
the phone finds matching detail and shifts each frame into register → ▣▣▣▣ stacked true
3. REJECT or repair the parts that moved (a passing car, a waving hand) so they don't ghost.
moving regions are detected and dropped/blended → fewer frames there, but no smear (usually)
4. MERGE the aligned frames: average them to cancel noise (§23.1), summing the light.
8 dark noisy frames → 1 brighter, far cleaner frame
5. TONE-MAP and finish: lift the result to a viewable brightness, manage highlights (streetlights!),
apply color and sharpening (§23.5).
→ the final bright, clean night photo
The magic is steps 2–4. The failure is always in step 3: things that MOVED.
Walk the steps. The phone captures a burst over a second or several — not one long exposure (which would blur from your hand shake and over-brighten the streetlights) but many short ones. It then aligns them, correcting the small drift of your hands between frames so that the stationary world lands on the same pixels in every frame. It averages the aligned frames, which (per §23.1) cancels the noise and sums the dim light into a usable signal. And it tone-maps the bright, clean result down to something that looks like a photograph rather than a foggy grey rectangle. The reason it beats a single long exposure is that short frames freeze your hand-shake, and aligning-then-averaging lets the phone combine them as if you'd held perfectly still — which no human can do for two seconds.
How to shoot night mode
- Brace harder than feels necessary, for longer than feels necessary. The capture runs for seconds. The better you imitate a tripod, the more frames the phone trusts and the longer an exposure it will attempt, which means a cleaner result. Lean on a wall, set the phone on a ledge, prop it against a railing. A three-dollar mini-tripod transforms night mode (it will often extend to much longer captures when it detects it's stable).
- Tell your subject to freeze. This is the whole game (see below). Night mode of a person requires them to hold still for the entire capture — brief, but real. "Hold still… got it" is a night-mode skill.
- Tap to expose for what matters, and watch the highlights. Bright point sources — streetlights, a neon sign, the moon — can still blow out even as night mode opens the shadows. Often you want to drag exposure down a touch so the lit sign keeps its detail and color.
- Don't expect it everywhere. Most phones only offer (or auto-trigger) night mode below a brightness threshold. In merely dim light it may not engage; that's fine — a short stack in normal mode is doing similar work invisibly.
⚠️ Common Mistake: Using night mode on anything that moves — and blaming the blur on the dark. Night mode's fatal weakness is motion. Stationary scene: miraculous. Moving subject: smear, ghost, or a person rendered as a translucent double. People shoot a friend laughing at a night party in night mode, get a blurred face, and conclude their phone is "bad in low light." The phone is extraordinary in low light — on things that hold still. The face blurred because it moved across a multi-second capture. The fix is either (a) have the subject freeze for the capture, (b) add a little light (a nearby sign, a friend's phone flashlight) so the phone uses a shorter night-mode capture, or (c) accept that for fast night action you may need a single brighter exposure and the noise that comes with it. Match the tool to the motion.
🖼️ Read This Frame: Here is night mode doing exactly what it's for — and a note on where it would have failed. This is set in our recurring city block at night location.
text FIGURE 23.4 — "The quiet corner, after midnight" [constructed teaching example] THE FRAME A narrow side street recedes into the upper-right third. In the foreground left, a closed café's window glows faintly; wet cobblestones fill the lower frame, holding soft reflections of a single warm streetlamp. No people. A bicycle leans, unmoving, against a wall mid-frame. THE LIGHT One warm sodium streetlamp upper-center, plus the cool spill from the café window. The contrast between the two color temperatures is the whole mood. Deep shadow between them, now opened by night mode to reveal brickwork and a doorway that the eye, standing there, could barely make out. THE MOMENT Stillness itself — chosen precisely *because* nothing moves. The empty street is what makes night mode succeed: every one of the dozen frames is identical, so they stack perfectly. THE CHOICES Phone braced on a mailbox for the ~3-second capture. Exposure dragged down half a step so the streetlamp keeps its warm glow instead of blowing to a white blob. Framed low to use the wet-stone reflections as foreground. Shot in night mode; no tripod, just a steady ledge. THE EFFECT The eye enters on the bright café window, follows the reflections down the wet stones, travels up the street into the opened shadow. It reads as a calm, cinematic, *seen* scene — far more than the murk that was actually visible to the naked eye. THE LESSON Night mode's gift is the *still* scene: it gives you a clean, low-noise image of darkness that no single handheld frame could. Its price is paid the instant anything moves. Shoot the empty street, the lit building, the landscape — the patient subjects — and night mode looks like sorcery. Put a running figure in it and the sorcery breaks.🔬 The Physics: (Optional.) Why short frames instead of one long one? Two reasons, both about motion. First, your motion: a single 3-second handheld exposure records every tremor of your hands as blur; a burst of 30 frames at a tenth of a second each freezes the tremor in every frame, and alignment removes the drift between frames — so you get the light of a 3-second exposure with the sharpness of a fast one. Second, the highlights: a long exposure keeps piling light onto the streetlamp until it blows out to a white disc, while short frames let the algorithm hold the bright point and lift only the dark surroundings. The cost is computational: aligning and merging dozens of frames, detecting and rejecting the regions that moved (step 3 in Figure 23.3) so a passing car becomes a faint ghost or vanishes rather than smearing across the frame. The "rejection" step is also why night-mode shots of moving things look partially there — the algorithm found motion, dropped some frames for that region, and had less data to work with exactly where the action was. None of this needs to be in your head to press the button — but it tells you precisely why the empty street works and the moving subject doesn't.
🔄 Check Your Eye: 1. Why does night mode capture many short frames instead of one long exposure? 2. Your night-mode shot of a building is gorgeous, but the friend standing in front of it is a blurry ghost. Without changing phones, name two ways to fix the friend.
Answers
- Many short frames freeze your hand-shake (each frame is sharp) and let the phone align and average them to cancel noise and sum the light — combining them into the equivalent of a long exposure but without the motion blur a true long handheld exposure would suffer; it also lets the algorithm protect bright highlights instead of letting them pile up and blow out. 2. (a) Have the friend hold completely still for the whole capture; (b) add light on them (step into a streetlight's pool, have someone shine a phone light) so the phone uses a shorter night-mode capture they can hold through; or (c) shoot a single brighter exposure and accept some noise. The building held still and stacked perfectly; the friend moved.
23.4 Portrait mode: depth maps and synthetic bokeh
Of all the computational tricks, portrait mode is the one that most clearly fakes an optical effect — and understanding the fake is the only way to use it without getting caught. In Chapter 4 you learned that a fast lens with a wide aperture (say f/1.8 on a full-frame camera) throws the background into a soft, creamy blur while the subject stays razor-sharp — the quality called bokeh (you met it in Chapter 4), and the look most associated with "professional" portraits. A phone's tiny sensor and small lens physically cannot produce much of that blur; everything is more or less in focus, which is why straight phone portraits look "flat" and amateur next to a camera shot.
Portrait mode fakes it. Instead of producing the blur optically, the phone measures (or estimates) how far away each part of the scene is, builds a map of those distances, and then digitally blurs the parts it decides are "background" while keeping the "subject" sharp — synthesizing the look a fast lens would give without the lens. Two new ideas make this work, and both are worth defining because both are where it goes wrong.
A depth map is a second, invisible image the phone computes alongside the photo: instead of recording color at each pixel, it records distance — how far that pixel's subject is from the camera. Think of it as a grayscale picture where near things are bright and far things are dark (or vice versa). With a depth map in hand, the phone knows what is "close" (the face) and what is "far" (the wall behind), and can treat them differently.
Synthetic bokeh is the result: a digitally simulated background blur applied according to the depth map, imitating the out-of-focus rendering of a fast lens. The phone keeps the near depths sharp and progressively blurs the far ones, often even simulating the soft discs that out-of-focus highlights make in a real lens. When it works, a phone portrait suddenly has the subject-pops-from-background look that used to require an expensive lens. When it fails, you get the artifacts every trained eye now recognizes.
FIGURE 23.5 — How portrait mode fakes shallow depth of field
REAL OPTICS (a fast lens): focus plane is physical; blur is continuous and true
[CAM] f/1.8 ──► ( face SHARP ) ········ background dissolves smoothly by distance
PHONE COMPUTES IT INSTEAD:
1. ESTIMATE DEPTH (build a depth map):
near = white, far = black
░░░░░░░░░░░░░░░░░░░░░░░░
░░░░░░░██████░░░░░░░░░░░ ← the white blob is the detected SUBJECT (the face/body)
░░░░░░██████████░░░░░░░░
░░░░░░░████████░░░░░░░░░░ everything dark = "background," to be blurred
░░░░░░░░░░░░░░░░░░░░░░░░
2. CUT OUT the subject along that map's edge (this is the hard part — see hair, glasses)
3. BLUR everything marked "far," keep everything marked "near" sharp
4. Optionally render fake bokeh discs where the background had bright points
RESULT: a phone image that LOOKS like f/1.8 — until the depth map gets the EDGE wrong.
Where the depth map comes from
Phones estimate depth in a few ways, and which one your phone uses shapes how well it works:
- Two cameras (stereo). Two lenses a centimeter apart see the scene from slightly different angles; comparing the two, the phone triangulates distance — the same way your two eyes judge depth. Near things shift more between the two views than far things do.
- Dedicated depth sensors. Some phones project a pattern of infrared dots or measure the time light takes to bounce back (time-of-flight), building a genuine 3-D measurement of the scene. More accurate, especially in low light.
- Machine-learned single-image depth. Increasingly, phones guess depth from a single image using a trained model that has learned what near and far tend to look like. Astonishing when it works; prone to confident mistakes (it can be fooled by a photo of a scene, or by an unfamiliar shape).
Most phones blend these. The key practitioner fact is simply: the depth map is an estimate, and the blur is only as good as that estimate. Where the estimate is confident and the edge is clean — a person's shoulder against a far wall — portrait mode is convincing. Where the estimate struggles — fine hair, eyeglasses, a mug's handle, chain-link fence, a subject and background at similar distance — the seams show.
The artifacts that give it away
Train your eye on these; once you see them you cannot unsee them, and you'll catch them in your own work and everyone else's:
- The hair halo / haircut. Flyaway strands of hair are the hardest thing to segment. Portrait mode often either blurs them away (an unnaturally clean haircut) or leaves a sharp fringe of hair against an already- blurred background (a halo). Curly or textured hair, and hair against a busy background, suffer most.
- The cutout edge. Look along the subject's outline against the blurred background. A too-perfect, almost cut-with-scissors transition — sharp subject, instantly blurred background, no gradual falloff — betrays the digital mask. Real optical blur eases in with distance; the fake often snaps.
- Wrong things blurred or kept. A coffee cup the subject holds, an arm reaching forward, the gap between an elbow and the body — these confuse the depth map. You'll see a hand the same distance as the face get blurred, or a chunk of background near the head stay weirdly sharp.
- Glasses and transparent things. The depth map struggles with what it can see through. Lenses, bottles, windows, and glassware produce odd partial blurs.
- Uniform too-much blur. Real bokeh increases gradually with distance — a wall two metres back is softly blurred, a tree fifty metres back is a wash. Synthetic bokeh often blurs everything "background" by the same heavy amount, flattening the depth it was trying to fake.
⚠️ Common Mistake: Maxing the blur and shooting busy backgrounds. Two errors compound here. First, cranking the simulated aperture to its blurriest setting exaggerates every segmentation error — the hair halo, the cutout edge, the wrongly-blurred hand all get worse. A moderate blur is more convincing than a maximal one. Second, portrait mode struggles most when the background is close and cluttered (a bookshelf right behind the subject, a chain-link fence) because there's little real depth difference to read and lots of edges to mis-segment. The fix is the same one a real lens would want: put distance between your subject and the background. Step them forward, away from the wall. More real depth gives the algorithm an easy map, and the synthetic blur becomes believable. Computation works best when you've already given it a scene that's easy to read.
💡 Why It Works: The deep reason portrait mode exists is purely physical and you met it in Chapter 4: background blur depends on the physical aperture diameter and the sensor size, and a phone's are tiny, so a phone genuinely cannot blur a background much optically. The computation isn't cheating around a software limit — it's substituting math for an optics limit that can't be removed without a bigger camera. That's also why it will never be perfect: it's reconstructing 3-D information (distance at every pixel) from a flat image plus a little stereo or sensor data, and reconstruction can only estimate. A real lens doesn't estimate depth — it simply is at one focus distance, and physics blurs the rest exactly right.
♿ Accessibility & Inclusion: Two notes where portrait mode meets real people. First, depth-from-image models have historically been trained on uneven data and can segment some skin tones, hair textures, and face shapes less well than others — coily and curly hair in particular is often mangled at the edges. If a portrait of someone is producing a bad halo, it may not be "their fault" or yours: it can be a limitation in how the model was trained. The reliable fix is real distance to the background plus moderate (not maximal) blur, which eases the segmentation. Second, when you write alt text for a portrait-mode image (Chapter 1's Described-Photograph skill again), describe the person and the moment, not the simulated blur — the blur is a style choice, the human is the content.
📸 In the Field — Test the depth map's limits. In your kitchen window location, shoot a friend (or a stuffed toy, a houseplant, a mannequin head) in portrait mode under conditions designed to break it: (1) against a wall right behind them, then again with them stepped three metres forward; (2) wearing glasses, then without; (3) holding a mug out toward the camera; (4) with messy, flyaway hair if possible. Shoot 10–12 frames across these. Then zoom in on the edges and find every artifact — the haloed hair, the blurred hand, the snapped cutout. Keep the two best (where the fake is convincing) and the two worst (where it fails), and write one sentence each on why the depth map succeeded or failed. This is the single fastest way to learn to trust — and distrust — portrait mode.
🔄 Check Your Eye: 1. What is a depth map, and why does portrait mode need one? 2. Name three subjects or situations where synthetic bokeh predictably fails at the edges.
Answers
- A depth map is a computed image recording distance (not color) at each pixel — how far each part of the scene is from the camera. Portrait mode needs it to decide what counts as "subject" (keep sharp) and what counts as "background" (blur), so it can simulate shallow depth of field. 2. Any three of: fine or flyaway hair (esp. curly/coily or against a busy background); eyeglasses and transparent objects; a hand/cup reaching toward the camera at a different distance than the face; a subject close to a cluttered background with little real depth difference; chain-link or fine repeating patterns.
23.5 Computational color, sharpening, and the "phone look"
You have probably noticed that you can often tell a photo was shot on a phone — even a good one — without being told. There is a look: bright, punchy, contrasty, very sharp, vivid skies, sometimes a faint plasticky smoothness to faces. That look is not the sensor. It is a stack of computational decisions applied after capture, every time, by the phone's image processing, and most of them you can't see being made. To shoot with judgment you need to know they're happening, because each one can quietly lie about the scene.
Here is the assembly line that runs on every frame, even your "normal" snapshots:
FIGURE 23.6 — The "phone look" pipeline (applied to nearly every frame, invisibly)
RAW sensor data (flat, dull, slightly noisy, what the chip actually saw)
│
▼
[ DEMOSAIC ] reconstruct full color from the sensor's color-filter mosaic (Ch.2's Bayer array)
│
▼
[ STACK / HDR / NIGHT ] the multi-frame merges from §23.1–23.3, if engaged
│
▼
[ WHITE BALANCE + COLOR ] auto-neutralize, then push toward pleasing: bluer skies, greener grass,
│ warmer skin — a "memory color" boost, not what was literally there
▼
[ LOCAL TONE + CONTRAST ] open shadows, add punch, lift local contrast (§23.2's tone mapping)
│
▼
[ SHARPENING ] exaggerate edges so the image reads "crisp"; aggressive on a small screen
│
▼
[ NOISE REDUCTION ] smooth the speckle — which also smooths fine texture (skin, foliage, hair)
│
▼
[ SEGMENTATION-AWARE TWEAKS ] detect SKY → boost blue; detect FACE → smooth + brighten; detect
│ FOLIAGE → boost green; detect FOOD → warm + saturate (§ below)
▼
The familiar, processed JPEG/HEIC — vivid, sharp, "phone-looking"
Three of these stages deserve a closer look because they're where the phone most often departs from reality.
Computational color. The phone doesn't just neutralize the white balance (Chapter 5); it pushes colors toward what people remember and prefer rather than what was measured — the so-called memory-color boost. Skies get bluer, grass greener, sunsets more saturated, skin a touch warmer and smoother. It usually looks great. But it means your phone's rendering of a scene is an opinion, not a record — which matters when you need color to be accurate (documenting a product's true color, matching a paint sample, a forensic record) or when the real scene's subtle, desaturated color was the point and the phone has overruled it.
Sharpening. Phones sharpen aggressively because sharp images look impressive on a small bright screen. Edges are exaggerated; fine detail is crisped. The cost: over-sharpening produces a hard, brittle, sometimes "crunchy" look, halos along high-contrast edges, and an exaggeration of any noise. Skies and smooth gradients can show faint outlines. It's tuned for the phone screen, not for a large print, where the over-sharpening becomes obvious.
Noise reduction — and the "watercolor" smear. To kill the speckle from a small sensor (especially in low light), phones smooth aggressively. The trouble: the algorithm can't always tell noise from fine texture, so it smooths both. Push it and skin turns waxy and plastic, foliage dissolves into green mush, hair loses its strands, distant brickwork smears into a painterly blur. This "watercolor effect" is the dead giveaway of heavy noise reduction, and it's why phone photos can look detailed at a glance and fall apart when you zoom in.
Semantic segmentation: the phone knows what it's looking at
The newest layer, and the one that most changes the game, is semantic segmentation: the phone identifies what different regions of the image are — this region is sky, that is a face, that is foliage, that is food — and then processes each region differently and automatically. It doesn't just adjust the photo; it adjusts the sky and the skin and the grass with separate, tuned recipes.
This is why the modern phone look is so consistent and so seductive: the sky is always that ideal blue, faces are always brightened and smoothed, food is always warm and appetizing, foliage always lush — because the phone detected each and applied the crowd-pleasing treatment. It's genuinely clever. It's also where computation most confidently invents: it can brighten a sky that was meant to be ominous, smooth a face you wanted weathered and characterful, saturate food past what was on the plate. The phone made an editorial decision about your photograph's meaning, based on a category, without asking.
🔬 The Physics: (Optional.) Semantic segmentation runs on the same kind of trained model as the machine-learned depth estimation in §23.4 — a network that has learned, from millions of labeled images, what "sky" and "skin" and "grass" look like, and outputs a per-pixel label map (the semantic cousin of the depth map). Once the phone has that label map, it applies region-specific adjustments: a sky mask gets a blue/contrast curve, a skin mask gets a smoothing-and-warming curve, and so on. The same machinery powers the sky-replacement and "magic eraser" tools you'll meet conceptually in Chapter 33 — segment a region, then synthesize or swap it. The practitioner consequence is profound: your phone is no longer adjusting brightness and color generically; it is making content-aware changes based on what it thinks objects are — and a wrong guess (mistaking a blue wall for sky, a painting for a real scene, a sculpture for a face) produces a confident, wrong edit. This is the bridge from "computational photography" to "the camera has opinions about your subject."
⚠️ Common Mistake: Believing the phone's JPEG is "what the camera saw." It is not — it's the camera's heavily-processed interpretation, after color boosting, tone mapping, sharpening, noise reduction, and segmentation-aware tweaks. Treating it as ground truth leads to two errors: you stop noticing how far the rendering has drifted from the real scene, and you have no way back when the processing went wrong (a face smoothed to plastic, a sky over-blued, texture smeared). The fix is to shoot RAW when accuracy or control matters (next section): a RAW file is the sensor data before most of this pipeline, giving you the real scene to develop yourself. Shoot the processed JPEG for speed and sharing; shoot RAW when you need the truth and the control.
🔄 Check Your Eye: 1. Name the pipeline stage responsible for each: skin that looks waxy and plastic; a sky that's bluer than it really was; a "crunchy," over-edged look. 2. What is semantic segmentation, and why does it make phone photos look so consistent?
Answers
- Waxy plastic skin → noise reduction (over-smoothing, the "watercolor" effect, sometimes plus face-smoothing from segmentation); bluer-than-real sky → computational color / memory-color boost (and a sky-detecting segmentation tweak); crunchy over-edged look → sharpening (over-applied). 2. Semantic segmentation is the phone identifying what each region is (sky, face, foliage, food) and applying a separate tuned treatment to each. It makes phone photos consistent because the same flattering recipe is applied to every detected sky, every face, every plate — so they all converge on the same crowd-pleasing look.
23.6 When computation helps and when it lies
You now know the machinery. This final section is the judgment that the machinery is for: a working decision procedure for when to trust the computation, when to override it, and when to take the controls entirely by shooting RAW. This is the section to screenshot.
When computation genuinely helps — let it work
The modes are not gimmicks. In their home territory they do things no single frame and no amount of manual skill can match on a phone:
- High-contrast scenes → HDR. Backlit subjects, bright-window interiors, landscapes with bright skies. The merge recovers range you physically cannot hold in one frame. Trust it; just watch the tone mapping.
- Static low-light scenes → night mode. Empty streets, lit buildings, landscapes, still life after dark, the moon over a quiet harbor. The stacking gives you clean images of darkness that are otherwise impossible handheld. Trust it; just brace and watch the highlights.
- Subject-against-distant-background portraits → portrait mode. A person well separated from a far, simple background, moderate blur. The synthetic bokeh sells the look. Trust it; just give it real distance and don't max the blur.
- Casual sharing, good light, no time → the full auto pipeline. When the light is good and you just want a pleasing, shareable image fast, the processed JPEG is excellent. Let the phone cook.
When computation lies — override it
Each mode has scenes it actively damages. Recognize them and take back control:
FIGURE 23.7 — The override decision table
SCENE / GOAL COMPUTATION'S INSTINCT WHAT TO DO INSTEAD
──────────────────────────────────────────────────────────────────────────────────────────
Moody low-key shot, dark on purpose HDR opens every shadow Turn HDR OFF; one dark exposure;
(silhouette, single light, noir) → flattens the drama protect the shadows (Ch.8)
──────────────────────────────────────────────────────────────────────────────────────────
Moving subject in the dark Night mode smears/ghosts it Subject FREEZES, or add light, or
(person, car, dancer at night) → blurry double single brighter exposure + noise
──────────────────────────────────────────────────────────────────────────────────────────
Hair/glasses/cup/cluttered bg Portrait mode mis-segments Add real distance; MODERATE blur;
(the hard portrait) → halos, snapped cutout or skip fake blur entirely
──────────────────────────────────────────────────────────────────────────────────────────
Accurate color needed Memory-color boost Shoot RAW; set white balance
(product, art, document, match) → bluer/warmer than real yourself; trust the data not the JPEG
──────────────────────────────────────────────────────────────────────────────────────────
Fine texture matters Noise reduction smears it Shoot RAW; control NR in edit;
(skin character, foliage, fabric) → waxy "watercolor" more light / lower ISO at capture
──────────────────────────────────────────────────────────────────────────────────────────
Maximum quality / you'll edit later Pipeline "bakes in" choices Shoot RAW; YOU make the decisions
(a keeper, a print, a pro job) → irreversible JPEG in the digital darkroom (Part VI)
The master override: shoot RAW
The single most powerful way to take control back from the computational pipeline is to capture a RAW file — the relatively unprocessed sensor data, before demosaicing-to-taste, before the color boost, the tone mapping, the sharpening, the noise reduction, the segmentation tweaks (you met RAW vs. JPEG in Chapter 2, and you'll develop RAW deliberately in Chapter 26). A phone RAW won't have the magic auto-HDR baked in — but it gives you the real scene, with all the latitude to make every one of those decisions yourself, in post.
The trade-off is exactly what you'd expect:
- The phone's computed JPEG/HEIC: gorgeous out of the camera, the HDR and night-mode magic included, but the choices are baked in and partly irreversible, and they may have lied (over-blued sky, plastic skin, flattened mood). Best for speed, sharing, and good-light snaps.
- RAW: flat and unimpressive straight off the sensor, no auto-magic, larger files, requires you to develop it — but it's the truth, fully malleable, with the most highlight and shadow latitude and your decisions about color, tone, and texture. Best for keepers, accuracy, difficult light, and anything you'll print or edit seriously.
- The pro move — shoot both. Many phones can save the processed JPEG and the RAW simultaneously. Keep the JPEG for the moment you want to share now; keep the RAW for when you want to do it properly. You lose nothing but storage.
🚪 Threshold Concept: Your phone is no longer recording the scene — it is rendering an interpretation of it. Once you truly absorb this, your relationship to phone photography changes permanently. Every phone image is a negotiation between the photons that arrived and the manufacturer's idea of a pleasing picture, executed by software that stacks, fuses, blurs, boosts, smooths, and segments — often beautifully, sometimes dishonestly. The processed image is not a window onto reality; it is an opinion about reality. Knowing this doesn't make you cynical about the computation — it makes you its director instead of its passenger. You let it work where it serves your vision, and you override it (with manual exposure, with HDR off, with RAW) where its opinion isn't yours. That is the whole skill of computational photography: not the tricks, but the judgment of when to let the machine decide and when to decide yourself.
🎒 Gear Note: You do not need a flagship phone to practice everything in this chapter — but capability does vary, and here's the principle (not a model list). Three things separate a basic computational camera from an advanced one: (1) how many frames it captures and how cleverly it aligns/rejects them (better night mode, fewer ghosts); (2) whether it has dedicated depth hardware (more reliable portrait edges) versus pure machine-learned depth; and (3) whether it lets you shoot RAW and offers a manual/pro mode (the override). If you're choosing a phone for photography, the RAW-and-manual capability matters far more than another tenth of a millimetre of sensor. And the phone-only reader lacking RAW still has every conceptual tool here: brace for the stack, expose for the highlights, give portrait mode distance, and turn HDR off for mood. Judgment, not hardware, is the real upgrade.
🎞️ Behind the Image: (A constructed but representative vignette.) A photographer shooting a documentary series in a dim community kitchen kept getting faces that looked, in her words, "like mannequins" — smooth, bright, characterless, the texture and warmth of real working hands sanded away. She assumed the low light was the problem and fought it with more lamps, and the faces got worse, not better: brighter light gave the noise reduction and face-smoothing even cleaner data to plastic-ify. The fix was the opposite of more gear. She switched the phone to RAW, accepted a flatter file on the screen, and developed each frame herself in the mobile darkroom (Chapter 25) — dialing the noise reduction down, keeping the lines and pores and the honest, lived-in skin. The series suddenly looked like the people it was about. The lesson she repeats: when a phone makes everything too perfect, the problem usually isn't the light — it's the computation, and the answer is to take the controls.
🔄 Check Your Eye: 1. Give two distinct scenes where you should turn a computational mode off, and which mode. 2. What does shooting RAW give back to you that the computed JPEG takes away — and what does it cost?
Answers
- Any two of: a moody low-key/silhouette image → turn HDR off (it would open the shadows you want dark); a moving subject in the dark → don't rely on night mode (it ghosts motion); a portrait with hard edges (hair/glasses) against clutter → skip portrait mode or give it distance; an accurate-color or fine-texture job → bypass the auto JPEG pipeline by shooting RAW. 2. RAW gives back the real, unprocessed sensor data — full control over color, tone, highlight/shadow latitude, sharpening, and noise reduction, none of it baked in or lied about. It costs: a flat unimpressive straight-off-the-sensor look, larger files, the loss of the automatic HDR/night magic, and the requirement that you develop it.
Portfolio Checkpoint
Throughout this book you are building a curated Photography Portfolio of twenty to thirty images. This chapter adds a computational experiment — a frame that proves you understand both the power and the deception of the modern phone.
The assignment, in one sentence: Reshoot a tricky scene — one that genuinely defeats a single ordinary exposure — using a computational mode (Night, HDR, or Portrait), and document what the computation added and what it faked.
Pick a scene that needs the help: a backlit window interior (HDR), a dim street or lit building after dark (Night), or a portrait that needs subject-background separation a phone can't give optically (Portrait). First, shoot it the "honest" way — a single normal exposure with the mode off — so you have a baseline of what the scene really looks like to the sensor. Then shoot it with the computational mode engaged, doing everything this chapter taught: brace through the capture, expose for the highlights, give portrait mode real distance. Keep the best computational frame.
Why this image belongs: Your portfolio already shows you can capture light, moment, frame, and focus deliberately. This frame shows something subtler and very modern — that you can make a computational camera serve your vision rather than impose its own, and that you can see through its tricks. That critical literacy is a genuine skill and a marker of a thinking photographer, not just a button-pusher.
The curation note — and this is the real point: Beside this image in your running list, write two short lines. First: what the computation added (range you couldn't hold; noise it cleaned; a blur a phone lens can't make). Second: what it faked or got wrong (a halo on the hair; a tone-mapped sky that's too flat; a face it smoothed; a snapped cutout edge; a color it pushed). If the experiment produced an image good enough to earn a portfolio slot, keep it as a keeper; if it mainly taught you where computation lies, keep it as a documented study and note what you'd do differently. Either way, you now hold a frame that demonstrates you understand the most important shift in how photographs are made this decade.
Summary
This chapter opened the black box. The settings are mostly automatic; the judgment about when to trust them is the skill.
- A modern phone computes a photograph from many frames; it does not record one. Image stacking — capturing multiple frames and merging them pixel-by-pixel — is the foundational move. The signal (real brightness) reinforces across frames; the noise (random) averages toward zero, so more frames mean a cleaner, higher-range, or sharper image. The pause after the click is the stack being captured and fused; hold still and let it finish.
- HDR captures multiple exposures and merges them to hold a scene's full dynamic range (face and bright window), then tone-maps — compressing that wide range down to fit a display. Gentle tone mapping looks like your eye saw it; heavy tone mapping is the flat, grey, over-detailed, haloed "HDR look." Use HDR for high-contrast scenes; turn it off when you want shadows to stay dark (mood, silhouette, low-key).
- Night mode captures a burst of short frames, aligns them (correcting handheld drift), rejects what moved, and averages them to cancel noise and sum the dim light — then tone-maps the bright, clean result. Its gift is the static scene; its fatal flaw is motion (smear/ghost). Brace hard, watch the highlights, and have any subject freeze for the capture.
- Portrait mode fakes shallow depth of field a small phone lens physically can't produce: it builds a depth map (distance at each pixel, from stereo, a depth sensor, and/or machine learning), then applies synthetic bokeh — digitally blurring the "far" depths. It convinces when the edge is easy (subject far from a simple background, moderate blur) and betrays itself at hard edges: hair halos, snapped cutouts, wrongly-blurred hands, glasses. Give it real distance; don't max the blur.
- The "phone look" is a pipeline, not a sensor: demosaic → multi-frame merge → memory-color boost (bluer skies, warmer skin) → local tone/contrast → aggressive sharpening → noise reduction (which smears fine texture into "watercolor") → semantic segmentation (the phone identifies sky/face/foliage/food and processes each with a tuned recipe). The processed JPEG is an interpretation, not a record.
- When computation lies, override it — turn HDR off for mood, don't rely on night mode for motion, skip or distance portrait mode for hard edges, and above all shoot RAW when you need accurate color, fine texture, or full control. RAW is flat and unmagical out of the camera but gives back the real scene and every decision; shoot RAW + JPEG to lose nothing.
The one-line rule: Let the computation work where it serves your vision; override it (HDR off, manual exposure, or RAW) where its opinion isn't yours.
Spaced Review
Test yourself on earlier chapters without scrolling back — this is how the material sticks.
- (Chapter 22) Why does a phone's small sensor struggle in low light in the first place, and name two non-computational ways you learned to help it (before this chapter's stacking tricks).
- (Chapter 2) What is the difference between a RAW file and a JPEG, in terms of what each holds — and why does that difference matter so much for overriding the computational pipeline?
- (Chapter 22) What does locking exposure (exposure lock) let you do that tapping-to-focus alone does not — and why is aiming the exposure before capture essential to getting a good HDR or night-mode result?
Answers
1. A small sensor gathers little light per frame (small photo-sites, small total area), so in dim light the signal is weak relative to noise and images get grainy. Non-computational helps from Chapter 22: brace/ stabilize to allow a slower shutter without shake; add light (a window, a lamp, even a friend's flashlight); use the lowest usable ISO and a wider aperture; rest the phone on a steady surface. 2. A RAW file holds the relatively unprocessed sensor data with far more tonal latitude and no baked-in color/ contrast/sharpening; a JPEG holds an 8-bit, already-processed, compressed *interpretation* with the camera's choices locked in. RAW matters for override because it sits *before* the computational pipeline (color boost, tone map, sharpening, noise reduction, segmentation), so you can make all those decisions yourself instead of accepting the phone's. 3. Exposure lock fixes the brightness so it won't drift as you recompose or as something moves through the frame, letting you deliberately set exposure for the part of the scene that matters. Aiming exposure before capture is essential because the phone decides *how* to stack/merge (how many frames, how dark/bright) based on where you've told it the subject and the correct brightness are — set it wrong and the whole multi-frame computation optimizes for the wrong thing.What's Next
You have now captured images in every condition this book can throw at you — bright and dark, still and moving, optical and computed. Chapter 24 takes the camera (and you) on the road. Travel and aerial photography are about telling the story of a place: the establishing-to-detail sequence that makes a viewer feel they've been somewhere, the respect and permission that photographing unfamiliar people and cultures demands, packing a kit light enough to actually carry, and — when you add a drone — an entire new axis of view, along with the law, safety, and ethics that come with the sky. The computational fluency you built here travels with you: the phone that stacks and fuses is the one camera you'll never leave at the hotel.