> "If your pictures aren't good enough, you're not close enough."
Prerequisites
- 1
- 3
- 4
- 5
- 6
Learning Objectives
- Choose camera height deliberately — worm's-eye, eye-level, or bird's-eye — and predict the change in meaning each creates.
- Distinguish perspective (a product of distance) from focal length, and use the move-and-zoom relationship to compress or expand a scene on purpose.
- Control the visual relationship between a subject and its background by changing angle and position rather than by changing the subject.
- Work a scene by moving to subtract — circling, crouching, climbing, and closing in until the frame holds only what matters.
- Recognize wide-angle distortion and foreshortening, and decide in each case whether to exploit them or correct them.
- Treat point of view as a voice — a consistent, recognizable stance toward your subjects — rather than as an accident of where you happened to stand.
In This Chapter
- Overview
- Learning Paths
- 9.1 Camera height: worm's-eye, eye-level, and bird's-eye
- 9.2 Distance and focal length: compression versus expansion
- 9.3 Angle and the relationship between subject and background
- 9.4 Moving to subtract: working the scene
- 9.5 Perspective distortion: when to use it and when to correct it
- 9.6 Point of view as voice
- Portfolio Checkpoint
- Summary
- Spaced Review
- What's Next
Chapter 9: Perspective and Point of View: How Where You Stand Changes What the Photo Says
"If your pictures aren't good enough, you're not close enough." — attributed to Robert Capa
Overview
Stand a child and an adult in front of the same birthday cake and ask each to photograph it. The child's picture looks up at a towering monument of frosting against the ceiling; the adult's looks down at a flat disc on a table. Same cake, same room, same light — two completely different photographs, and the only thing that changed was the height of the lens. Neither of them touched a single setting. They just stood where they stood, and where they stood decided what the photograph said.
This is the most underused control in all of photography, and it is the one you were born with: your own two feet, and the vantage they choose. Most beginners photograph everything from exactly one place — adult eye level, a few feet back, lens pointed straight ahead — because that is where their body already is. They treat their standing height as a law of nature instead of a decision. And so every one of their photographs carries the same flat, default, "I-was-standing-there" point of view, no matter how varied the subjects are. The fix costs nothing. It is not a lens. It is the willingness to crouch, to climb, to lie down on the wet grass, to walk around to the other side, to take the three steps closer that feel awkward. The camera you already own can make a dozen completely different photographs of one subject. You just have to move.
This chapter is about that movement and what it does. We will separate two ideas that beginners constantly confuse — perspective (which comes from where you stand) and the look of a lens (which comes from focal length) — and show you that the most powerful tool for changing a photograph is not the zoom ring but your feet. We will look up and look down and learn what each says. We will learn to compress a scene until the mountains stack up behind the runner, and to expand it until a small room feels like a cathedral. And we will end where every technical chapter in this book ends: with the realization that all of this is in service of seeing — of building a recognizable way of looking at the world that is unmistakably yours.
In this chapter, you will learn to:
- Pick camera height on purpose — worm's-eye to make a subject loom, bird's-eye to map it flat, eye-level to meet it as an equal — and know what each one tells the viewer.
- Tell perspective and focal length apart, and use the fact that only moving changes perspective to compress a scene (telephoto, far back) or expand it (wide-angle, close in).
- Change the relationship between your subject and what's behind it by stepping left, right, up, or down — separating a subject from a busy background without moving the subject at all.
- "Work a scene" the way professionals do: never shoot one frame and leave, but circle, crouch, and close in, subtracting clutter by moving until only the picture remains.
- See wide-angle distortion and foreshortening for what they are, and decide — case by case — whether to lean into them for drama or correct them for honesty.
- Think of your point of view as a voice, the consistent stance that, over a body of work, makes your photographs recognizably yours.
Learning Paths
Where you stand is free, available to every camera, and therefore matters to every reader equally. Still, here is how to weight your attention:
📱 Mobile-only: This may be the single most important chapter in the book for you, because a phone hands you almost no optical controls — but it cannot stop you from moving. §9.1 (height) and §9.4 (working the scene by moving your feet) are where your phone competes with any camera on Earth. §9.2's compression is the one place your phone is genuinely limited; read the workaround there. 🎨 Hobbyist: §9.2 (compression vs. expansion) will explain a thing you have seen a thousand times and never had words for — why some photos "stack up" and others "open out." §9.3 and §9.5 will sharpen every portrait and landscape you make. 💼 Pro-track: §9.4 (working the scene) is a professional discipline, not a tip — it is the difference between coming home with one frame and coming home with the frame. §9.6 (point of view as voice) is what a client is really hiring when they hire you and not a stock library. 🎓 Student: Every section here is assessable through the Exercises and the Portfolio Checkpoint. The compression demonstration in §9.2 is the kind of thing that shows up on every exam; understand the move-and-zoom relationship cold.
9.1 Camera height: worm's-eye, eye-level, and bird's-eye
Before we touch focal length or distance or angle, start with the simplest variable of all, the one you change just by bending your knees: how high off the ground the camera is. We will call this the camera height — the vertical position of the lens relative to your subject. It is the first decision of vantage, it costs nothing, and it changes a photograph's meaning more reliably than almost anything else you can do.
Here is the principle in one sentence: the camera looks at a subject the way a person standing at that height would look at it, and the viewer feels whatever that look implies. Put the camera low and aim up, and the subject towers over the viewer — it becomes powerful, monumental, heroic, sometimes threatening. Put the camera high and aim down, and the subject shrinks beneath the viewer — it becomes small, vulnerable, mapped, sometimes pitiable. Put the camera level with the subject and you meet it as an equal, eye to eye, neither above nor below. None of this is arbitrary. It is wired into how we read the world: we look up at things bigger and more powerful than us and down at things smaller and weaker, and a photograph borrows that entire vocabulary the instant you choose a height.
Let's take the three positions in turn.
Eye level is the default — which is exactly its problem and its power. A photograph made at the subject's own eye level feels natural, neutral, and direct, because it is the height from which we ordinarily meet things. For a portrait of an adult, eye level means the lens at their eye height, and the result reads as a respectful, equal encounter — the standard for honest portraiture (we build the whole craft of it in Chapter 13). The trouble is that eye level is also where your body already is, so it is the height beginners use for everything by default, including the many subjects that would be transformed by a different one. Eye level is a fine choice. It is a terrible only choice.
Worm's-eye view makes the subject loom. Drop the camera low — to the ground, to knee height, to the pavement — and tilt it up, and ordinary things become monuments. A worm's-eye view (so called because it sees the world as a worm would, from the dirt looking up) puts the viewer beneath the subject. A toddler photographed from the floor looks up at us no longer; we look up at them, and a small child acquires the visual authority of a statue. A building shot from its base soars. A flower shot from below stands against the open sky instead of against a busy lawn. The low angle does two things at once: it lends the subject power, and — a bonus we will use constantly in §9.3 — it often swaps a cluttered ground-level background for a clean expanse of sky.
Bird's-eye view maps the subject and shrinks it. Raise the camera high — hold it overhead, climb a chair or a hillside, lean over a railing, shoot straight down at a table — and the world flattens into a plan. A bird's-eye view (the world as a bird would see it, from above looking down) turns three-dimensional scenes into two-dimensional patterns: a plate of food becomes a graphic arrangement of shapes and colors; a crowd becomes a texture; a city becomes a circuit board. Aimed straight down, it removes the horizon entirely and converts a scene into pure design — which is why the overhead "flat-lay" is the native language of food and product photography. Aimed down at a person, it makes them small and can feel tender, or diminishing, or godlike-observing, depending on the subject.
Here is the whole vocabulary in one diagram.
FIGURE 9.1 — Three camera heights, one subject (side view)
BIRD'S-EYE [CAM] ↘
(high, aiming down) \
\ the viewer looks DOWN ON the subject
\ → subject reads small, mapped, vulnerable
▼
EYE-LEVEL [CAM] ──────►( S ) the viewer meets the subject as an EQUAL
(level with → subject reads natural, direct, neutral
the subject) ▲
/
/ the viewer looks UP AT the subject
WORM'S-EYE [CAM] ↗ / → subject reads large, powerful, heroic
(low, aiming up) ______/
The subject ( S ) never moved. Only the height of the lens — and therefore
the viewer's implied position relative to it — changed. That alone re-wrote
what the photograph says.
The lesson of the diagram is the lesson of the whole chapter in miniature: the subject did not change, you did. That is the red-door refrain from Chapter 1, and camera height is one of its cleanest proofs.
How to shoot it. The technique is almost embarrassingly physical. For a worm's-eye view, get the camera low — actually low, not "a bit lower." Crouch, kneel, sit, or lie down. On a phone, turn the screen so you can still see it while the phone is at ankle height (flip to the front camera, or just shoot a few frames blind and check). Many cameras have a tilting screen precisely so you don't have to lie in the mud. For a bird's-eye view, get above the subject: stand on a stable chair, hold the camera overhead and tilt the screen down, climb the small rise at the edge of the park, or — for a flat-lay — stand directly over a table and aim straight down, keeping the sensor plane parallel to the tabletop so the rectangle of the table stays a rectangle. Then, and this is the actual assignment: shoot the same subject from all three heights, every time, until choosing height becomes a reflex instead of an afterthought.
⚠️ Common Mistake: Shooting everything from standing adult eye level. It is the most common composition error there is, and it is invisible because it feels like "just taking the picture." The tell is a whole camera roll where every photo was made from the same five-and-a-half-foot height a few steps back. The fix is a rule you can apply today: for any subject worth a photograph, make at least one frame from a height that is not your standing eye level. Crouch for the dog. Climb for the dinner table. Most of the time the unusual height will be the keeper.
💡 Why It Works: Low and high angles work because they recruit a meaning your viewer already carries in their body. From infancy we look up at parents, authorities, and tall buildings, and down at children, pets, and things on the floor. So "looking up at" is wired to bigger, more powerful, to be respected or feared, and "looking down at" to smaller, weaker, to be protected or dismissed. When you choose a camera height you are not just changing geometry; you are reaching into that pre-loaded vocabulary and telling the viewer how to feel about the subject before they have consciously thought a thing.
Now let's bring back the subject this whole book uses to prove that the photographer, not the camera, makes the picture: the weathered red door on the alley wall. You first met it in Chapter 1, exposed it in Chapter 3, saw it in golden versus flat light in Chapter 5, placed it on a power point in Chapter 6, studied it as red-against-grey in Chapter 7, and read its tones in black and white in Chapter 8. Here it is again — same door, same paint, same handle — photographed from the two extremes of camera height.
🖼️ Read This Frame: The red door, looked up at and looked down at.
text FIGURE 9.2 — "The red door, worm's-eye" [constructed teaching example] THE FRAME The camera sits on the cobblestones at the foot of the door, aimed steeply up. The door rears back away from us at the top, its vertical edges leaning together toward a vanishing point somewhere above the frame. The iron handle, normally at hand height, now hangs over us like a fixture on a tower. Above the door, a thin slice of pale sky and the underside of an overhanging gutter. THE LIGHT The same low side light from Chapter 5, but now skimming UP the door's surface from our low vantage, so the paint's ridges throw long shadows downward toward the lens — the texture reads even more violently than straight-on. THE MOMENT No action; the moment is again the raking light. But the low angle adds a sense that we are small, arriving at the foot of something larger than us. THE CHOICES Camera on the ground, lens tilted up ~30°; wide-ish framing so the leaning verticals are emphasized, not hidden; the cobblestones included in the bottom edge to anchor us at ground level; sky kept as a clean cap at the top. THE EFFECT An ordinary back-alley door becomes a monument. The converging verticals (the leaning edges) say "tall, looming, photographed from below," and the viewer feels diminished before it — a door you'd ignore at standing height now has the gravity of a cathedral entrance. THE LESSON Lower the camera and aim up, and you hand the subject power. The same door that was a flat study of color in Chapter 7 is, from the dirt, a thing that towers.
text FIGURE 9.3 — "The red door, bird's-eye" [constructed teaching example] THE FRAME Now the camera looks DOWN at the door — imagine it has come off its hinges and lies flat on the alley floor, or that we have climbed a ladder and lean out over it. We see the whole rectangle nearly flat: the grain of the wood, the handle lying to one side, a scatter of grit and a fallen leaf on the painted surface, the threshold and a strip of cobblestone framing it. THE LIGHT The overhead, near-shadowless light of an overcast sky falls evenly across the flat plane. With the texture no longer raked, the door reads as pattern and color rather than as relief. THE MOMENT Static and still — the bird's-eye view drains away the sense of an entrance you could walk through and replaces it with the sense of an object laid out for inspection. THE CHOICES Camera held high and aimed straight down, sensor parallel to the door so its rectangle stays square in the frame; the fallen leaf placed deliberately off-center on a thirds line; tight enough that only a thin border of ground remains. THE EFFECT The door stops being architecture and becomes a flat-lay — a graphic field of weathered red with one small accent (the leaf) for the eye to land on. It feels mapped, surveyed, almost abstract; the looming power of Figure 9.2 is completely gone. THE LESSON Raise the camera and aim down, and the same subject flattens into design. Height alone moved this door from "monument" to "pattern" without a single change to the door itself.Two photographs, one door, opposite meanings — and the only variable was how high the lens sat. Keep both in mind; the worm's-eye version is your assigned anchor shot, and you will make your own this chapter.
🔄 Check Your Eye: 1. You photograph a friend's dog. From standing height it looks like every other dog photo. Name two different camera heights you could try and what each would say about the dog. 2. A flat-lay of breakfast and a heroic low shot of a skyscraper share the same underlying principle. What is it?
Answers
- (a) Worm's-eye — down at the dog's own eye level or below, aiming slightly up, makes the dog the monumental hero of its world and meets it on its terms. (b) Bird's-eye — straight down on the dog curled on the floor flattens it into a tender graphic shape. Either beats the default standing shot.
- Both use camera height to recruit the viewer's body-knowledge: looking up at the skyscraper makes it powerful; looking down at the breakfast makes it a flat pattern to read. In both, height — not the subject — sets the meaning.
9.2 Distance and focal length: compression versus expansion
This is the section that will explain something you have seen ten thousand times without having words for it: why some photographs stack up — distant mountains looming huge right behind a person — while others open out — a small room feeling cavernous, a near object ballooning toward you. Beginners almost universally credit this to "the lens." It is not the lens. It is where you stand, and untangling that confusion is one of the most useful things this book can do for your eye.
Let's define the key word. Perspective, in photography, is the apparent size and spatial relationship of objects in a scene — how big the foreground looks relative to the background, and how near and far things seem to stack or separate. And here is the fact that almost no beginner believes at first: perspective is determined entirely by your distance from the subject. Focal length does not change perspective at all.
Read that again, because it is the crux of the section. The only thing that changes perspective — the relative sizes of near and far — is moving your feet. Zooming does not change perspective; it only changes how much of the already-established perspective you crop into the frame. A telephoto lens does not "compress" a scene because of anything optical; it compresses because telephoto shots are taken from far away, and it is the distance that does the compressing.
Walk through why. Imagine two posts, one ten feet in front of you and one twenty feet in front of you — the far post is twice as distant as the near one. Standing here, the near post looks much bigger than the far post, because it is half the distance. Now back up until you are a hundred feet from the near post and a hundred-and-ten from the far one. Now the far post is only ten percent farther than the near one, so the two look almost the same size. You changed nothing about the posts and nothing about any lens — you just moved back, and the relative sizes collapsed together. That collapsing-together of near and far is compression: the apparent flattening and stacking of depth that happens when you photograph from far away. The opposite — photographing from very close, so the near thing looms enormous over a tiny background — is expansion: the exaggeration of depth that happens up close.
FIGURE 9.4 — Why distance, not the lens, makes compression (top-down view)
CLOSE (expansion): FAR (compression):
[CAM]──► (N)········(F) [CAM]─────────────────► (N)·(F)
near far near far
| big gap in apparent size | | tiny gap in apparent size |
(N) is MUCH closer than (F) in (N) and (F) are nearly the same
relative terms → (N) looms huge, distance away in relative terms →
(F) shrinks → DEPTH EXAGGERATED. they stack up the same size →
DEPTH FLATTENED.
( N ) = near object ( F ) = far object [CAM] = camera
The lens never entered this diagram. Only the camera's DISTANCE changed.
So where does focal length come in? Focal length decides how much you crop. When you stand far back to get compression, the subject is now small in your view — so you reach for a long (telephoto) lens, or zoom in, to crop tightly onto it and throw away the empty surroundings. When you get close for expansion, a huge near object would overflow any normal frame — so you reach for a short (wide-angle) lens to fit it all in. That is the entire reason "telephoto = compression" and "wide-angle = expansion" became shorthand: the distances those lenses push you toward are what create the effect. The lens just frames the result.
🚪 Threshold Concept: Perspective comes from your feet; focal length only crops. Internalize this and a whole category of confusion dissolves. You stop believing a lens has a magical "look" and start seeing that the look comes from the distance the lens made you shoot from. The practical payoff is enormous: it means you can get the telephoto "stacked" feeling by backing up with any lens you own, phone included, and crop in later — and you can get the wide-angle "expanded" feeling by getting close, not by buying glass. Where you stand is the creative decision; the lens is bookkeeping.
How to shoot compression. Get far away from your subject and use the longest lens or zoom you have (or shoot wide and crop hard afterward). Find a subject with strong layers receding into the distance — a row of fence posts, a street of parked cars, a runner with hills behind, telephone poles marching to the horizon — and back up until those layers visibly stack. The classic compression shot is the one where a setting sun or a mountain looks enormous directly behind a small human figure; that giant sun is not a trick of the lens, it is the photographer standing a quarter-mile back with a long telephoto, so the human and the distant sun sit at nearly the same relative distance and the sun no longer shrinks.
How to shoot expansion. Get close — closer than feels right — to a foreground object, with a wide lens or your phone's main (or ultra-wide) camera, and let that near object dominate while the background falls away small. A bicycle's front wheel thrust toward the lens with the whole street tiny behind it; a hand reaching into the frame; a room photographed from a corner so it yawns open. Expansion exaggerates depth and gives a sense of being there, inside the space.
🎒 Gear Note: This is the one place a phone is genuinely limited, and it's worth being honest about. Phone "zoom" beyond the main lens is usually digital — it just crops and enlarges pixels — so you cannot truly back up a quarter-mile and frame tightly with optical quality the way a long telephoto does. Two real workarounds: (1) For compression, shoot from far back at full resolution and crop in editing; you lose some pixels but keep the perspective, which is the part that matters. A phone with a dedicated 3×–5× telephoto camera does this honestly. (2) For expansion, your phone is excellent — its main and ultra-wide lenses are wide-angle, so getting close to a foreground object gives strong, real expansion with no compromise. The principle survives the hardware: move to set the perspective, use whatever glass crops it.
🔬 The Physics: (Optional — skip without penalty.) The reason perspective depends only on distance is pure geometry, the same similar-triangles logic behind the inverse-square law you met in Chapter 3. The apparent (angular) size of an object is, very nearly, its real size divided by its distance from the lens. So the ratio of apparent sizes of two objects equals the inverse ratio of their distances — and that ratio is fixed the instant you choose where to stand, before any lens is involved. Doubling the focal length doubles the apparent size of everything in the frame equally; it magnifies near and far by the same factor, so their ratio — which is what perspective means — is untouched. The lens is a uniform magnifier; only your position changes the relationship. (There is a subtle real effect called lens breathing and there are genuine wide-angle projection distortions at the frame edges — see §9.5 — but the core size-relationship truth is governed by distance alone.)
Let's make compression concrete with a frame you could go shoot this evening.
🖼️ Read This Frame: Compression at the end of a street.
text FIGURE 9.5 — "The stacked street" [constructed teaching example] THE FRAME Looking straight down a long, narrow city street at dusk. A row of traffic lights, then shop signs, then a distant tower, all appear stacked nearly on top of one another, filling the frame top to bottom with layered detail and almost no visible "depth" — the street reads like a flat collage of signs even though it runs for half a mile. THE LIGHT Even, blue-hour ambient with warm pools under each sign and streetlight; the warm/cool contrast (Chapter 7) separates the layers that compression has squeezed together. THE MOMENT A bus, far down the street, sits between two crosswalks — small in real space but, thanks to compression, nearly the same apparent size as the nearer cars, adding to the stacked feel. THE CHOICES Photographer standing a long block back, longest telephoto available (or maximum zoom, or a wide shot cropped hard later); camera at eye level, perfectly square to the street so the layers stack cleanly; foreground deliberately excluded so the eye reads pure depth-collapse. THE EFFECT The half-mile of street is compressed into a dense, graphic stack — a sense of a crowded, layered city that an up-close wide shot of the same street could never give. The distance did it; the long lens only cropped to it. THE LESSON To stack a scene, back up and shoot long. Compression is a decision about *distance* — you make it with your feet and merely execute it with focal length.📸 In the Field — The move-and-zoom proof. Go to the busy intersection or market (one of our four recurring locations) and pick a subject with something clearly behind it — a person at a stall with a building behind, a sign with the street beyond. Make two photographs of the same subject at the same size in the frame: first, get close and use your widest lens (or main phone camera up close); second, back up as far as you can and zoom or crop until the subject is again the same size in the frame. Shoot 20 variations if you like; keep the two cleanest. Lay them side by side. The subject is identical in size — but in the close one the background is small and far; in the far one it has swollen up huge right behind the subject. You just saw, in your own two frames, that distance — not the lens — controls perspective. That comparison will teach you more about lenses than any spec sheet.
9.3 Angle and the relationship between subject and background
Every subject sits in front of something, and that something is never neutral. The background is part of the photograph whether you chose it or not, and one of the most common reasons an otherwise-good frame fails is that the photographer was so focused on the subject that they never looked at what was behind it — so a lamppost appears to sprout from a friend's head, or a chaotic shelf of clutter fights the portrait for attention. The cure is not to move the subject. It is to move yourself, changing the angle between you, the subject, and the background until the relationship between them works.
The principle: for any fixed subject, a small change in your position can completely change its background, because the background is whatever lines up behind the subject from where you stand. Take one step left and the lamppost slides off your friend's head. Crouch and the cluttered shelf drops below the frame, replaced by a clean wall. Rise up and the busy street behind becomes a patch of quiet pavement. The subject did not move an inch; you changed which slice of the world fell behind it.
This gives you a repertoire of moves, each of which trades one background for another:
- Step left or right to slide a distracting object out from behind the subject, or to bring a helpful background element (a pool of light, a contrasting color, a leading line from Chapter 6) into alignment behind it.
- Crouch (lower the camera) to replace a cluttered ground-level background — cars, crowds, signage — with sky, ceiling, or the clean upper part of a wall. This is the §9.1 worm's-eye move doing double duty.
- Rise (raise the camera) to replace a busy mid-level background with the ground — grass, sand, pavement, a rug — turning chaos into a simple field of texture behind the subject.
- Circle the subject until the best available background is behind it. The light on the subject may also change as you circle (Chapter 5), so you are often hunting for the spot where good light on the subject and a clean background behind it coincide.
- Get closer or change focal length to throw the background out of focus (the depth-of-field control from Chapter 4) — but note that blurring a background is a different fix from changing it, and the strongest images often do both: choose a good background, then soften it.
FIGURE 9.6 — One subject, three angles, three backgrounds (top-down view)
[ messy shelf ]
│
(clean wall)────┼──── (bright window)
│
( S ) ◄── subject stands still
╱ │ ╲
╱ │ ╲
[CAM-A] ╱ [CAM-B] ╲ [CAM-C]
left center right
CAM-B (the default, straight on): subject is in front of the MESSY SHELF — distracting.
CAM-A (step left): now the CLEAN WALL falls behind the subject — calm, simple.
CAM-C (step right): now the BRIGHT WINDOW is behind — a blown-out distraction; worse.
The subject ( S ) never moved. Three small sidesteps put three different
backgrounds behind it. Choosing the angle IS choosing the background.
The diagram carries the whole idea: you do not get to move the shelf or the window, but you absolutely get to decide which one ends up behind your subject, simply by where you place your feet. And notice that not every move is an improvement — stepping right put a blown-out window behind the subject, which is worse than the shelf. Working the angle (the subject of the next section) means trying them and choosing, not just moving for movement's sake.
⚠️ Common Mistake: The merger — when subject and background fuse because the photographer never checked what was behind. The pole growing from the head, the tree branch slicing through the neck, the horizon line cutting across the eyes, the bright window swallowing the subject's edge. The fix is a habit: before you press the shutter, deliberately look past your subject at the background, all the way to the edges of the frame. Then take the one step that fixes it. This single check, done every time, will raise your keeper rate more than any gear.
💡 Why It Works: Changing your angle works because a photograph is two-dimensional — it collapses the real, three-dimensional separation between a subject and the things behind it onto one flat plane. In life your eyes and your motion tell you the lamppost is twenty feet behind your friend; in a flat photograph that depth information is gone, so the lamppost and the head simply overlap, and the eye reads them as touching. You can't restore the depth, but you can choose an angle where nothing unfortunate overlaps — which is why a problem invisible to you in the moment leaps out of the final image, and why the cure is always a change of viewpoint.
🔗 Connection: This section is the practical partner of Chapter 6's negative space and Chapter 4's depth of field. Chapter 6 taught you to want a clean, simple background; Chapter 4 taught you to blur a background with a wide aperture. This chapter adds the third and most basic tool: move so a better background is there in the first place. The professional habit is to use all three together — pick the angle, then frame for simplicity, then soften with focus.
Here is the move made concrete in our kitchen-window location.
🖼️ Read This Frame: Two steps, two portraits.
text FIGURE 9.7 — "The sidestep portrait" [constructed teaching example] THE FRAME A friend sits at the kitchen table in soft window light. VERSION A (the snapshot): shot straight on from where the photographer was standing, a cluttered counter — dish rack, bottles, a calendar — fills the wall behind them, every object competing with the face. VERSION B (the fix): the photographer took two steps to the left and crouched slightly; now a plain stretch of wall, with one soft rectangle of window light on it, sits behind the subject, and nothing competes. THE LIGHT Identical in both — soft, directional window light from camera-left (the Chapter 5 / Chapter 13 staple). The light on the face did not change; only the background behind it did. THE MOMENT The same relaxed half-smile in both; this is not about timing but about viewpoint. THE CHOICES Version B: two steps left, slight crouch, same focal length, same exposure. The only variable is the photographer's position relative to subject and background. THE EFFECT Version A's eye ricochets around the clutter and never settles; Version B's eye goes straight to the face and stays, because the simplified background gives it nowhere else to go. The portrait that looked impossible in a messy kitchen was two sidesteps away the whole time. THE LESSON You rarely need to tidy the room or move the subject. You need to move *yourself* until a clean background falls behind them. Angle is background.🔄 Check Your Eye: 1. A portrait has a streetlight apparently growing out of the subject's head. List three different moves that would each fix it, and what new background each would create. 2. Why does a background problem that you didn't notice while shooting jump out at you in the final flat photograph?
Answers
- (a) Step left or right — slides the streetlight off the head; new background is whatever is now behind (another part of the street). (b) Crouch and aim up — drops the streetlight out of frame and puts sky behind the head. (c) Rise / get the subject to move forward and back up so the streetlight falls far out of focus — same background, but blurred. (Reframing or changing focal length also count.)
- Because the photograph is flat: it collapses the real depth between the head and the distant streetlight onto one plane, so the two overlap and read as touching. Your eyes saw the depth in the moment; the photo throws it away.
9.4 Moving to subtract: working the scene
Here is a habit that separates people who take photographs from people who make them, and it has nothing to do with equipment. Amateurs walk up to a subject, raise the camera from wherever they happen to be standing, make one frame, and move on. Professionals do something that looks almost obsessive: they circle the subject, crouch and rise, step in and step back, try it vertical and horizontal, watch the background shift and the light change, and shoot dozens of frames from a dozen positions — and only then, in the edit, choose the one that works. This practice has a name. It is called working the scene: deliberately exploring a subject from many vantages, distances, and framings rather than settling for the first acceptable shot.
Working the scene is not about volume for its own sake — it is not "spray and pray." It is directed exploration, and its governing instinct is subtraction. As you move, you are constantly asking: what can I get rid of? What can I push out of the frame by stepping sideways, crouching, or moving in? A great frame is usually not the one with the most in it but the one with the least — everything inessential subtracted until only the picture remains. You met this idea in Chapter 6 as negative space and in Chapter 1's case study, where Dorothea Lange moved physically closer with each exposure until a roadside encounter became an icon. Working the scene is how you find that subtraction with your feet.
A simple, repeatable routine for working any scene:
- Make the obvious shot first, then promise yourself it's not the keeper. Get the safe frame from where you stand so you have something — then treat it as the worst frame you'll make today and start working.
- Change height. Crouch low; rise high; try eye-level last. (This is §9.1, applied.) Each height is a different photograph.
- Change distance. Step in until it feels too close — Capa's whole point in the epigraph — then step back for the wider relationship. Closeness is almost always under-used by beginners.
- Circle. Walk around the subject watching two things: where the light falls on it (Chapter 5) and what background lines up behind it (§9.3). Stop where they coincide.
- Switch orientation. Make it both vertical and horizontal; the right frame shape is a decision, not a default, and most beginners shoot horizontal out of habit even when the subject is vertical.
- Subtract, subtract, subtract. With every move ask "what can leave the frame?" Step until the answer is "nothing else — this is only what matters."
FIGURE 9.8 — Working a single subject (top-down map of positions)
☀ (light)
\
[4] \ [2] ← crouch here: clean wall behind,
(far, wide \ ╱ light rakes across the subject
relationship) ( S )
╱ │ ╲
╱ │ ╲
[1] │ [3] ← step in CLOSE; background drops away
(the safe, [5] and the subject fills the frame
obvious (low, aiming
first shot) up — heroic)
( S ) = subject [n] = a camera position you try, in order ☀ = the light
One subject, six+ positions. The keeper is almost never [1], the shot you'd
have stopped at. It is usually [2] or [3] — found only because you kept moving.
The map makes the discipline visible: position [1] is the lazy default that most people would have called "the photo." Positions [2] through [5] are where the actual photograph lives, and you only reach them by refusing to stop at [1]. The keeper is earned by movement.
How to do it without exhausting yourself. Working the scene sounds like a lot, and at first it is — but it becomes fast. A useful constraint: give yourself a number. "I will not leave this subject until I have made ten genuinely different frames" — different positions, not ten near-copies from one spot. The constraint forces you past the obvious. On a phone this is effortless and free; on any camera it costs only the willingness to look slightly odd crouching in public. The self-review that turns it into learning: shoot many, keep three, and be able to say why those three beat the rest. That sentence — "this one wins because…" — is where the eye actually develops.
📸 In the Field — Ten frames, one subject. Return to the park or trail at the edge of town and choose one subject that catches you: a single tree, a bench, a boulder, a gate. Do not leave it until you have made ten genuinely different photographs — different heights, distances, angles, and orientations, not ten versions of one view. Make the obvious shot first and then deliberately abandon it. At home, cull to the three strongest and write one sentence on each saying why it beats the obvious first frame. You will almost always find that your favorite is one you would never have made if you'd stopped after the first. This is the core working-the-scene exercise, and it feeds the Portfolio.
⚠️ Common Mistake: Shooting one frame and leaving — the "I got it" reflex. You did not get it; you got a version of it, almost always the most obvious one. The cost of one more minute of working the scene is nearly zero (especially on a phone); the cost of walking away with only the safe shot is a lifetime of merely-fine photographs. The opposite error also exists — chimping, obsessively reviewing the back of the camera after every frame instead of staying in the flow of working — so the balance is: keep moving and shooting, review in batches, and do your real choosing later, in the edit.
🎞️ Behind the Image: (A constructed but representative vignette.) A photographer set out to make one picture of a lighthouse and made the obvious postcard from the parking lot in the first thirty seconds. It was fine. Instead of driving off, they spent forty minutes working it — climbing the dune for a high angle, lying in the marram grass to shoot up through the wildflowers with the tower behind, circling until the late sun rimmed one edge, stepping in until a single weathered railing filled the foreground. The keeper, the one that later sold as a print, was the low shot through the grass — a frame that did not exist, could not have existed, from the parking lot. The postcard took thirty seconds. The photograph took forty minutes of refusing to leave.
9.5 Perspective distortion: when to use it and when to correct it
Get very close to a subject with a wide-angle lens and something strange and powerful happens: the parts of the subject nearest the lens balloon outsized, the parts farther away shrink, and straight lines near the edges of the frame may bow or lean. This is wide-angle distortion — the exaggeration of near-far size differences and the bending or stretching of forms that occurs when a wide lens is used close to a subject. It is not a flaw in the lens (though cheap lenses add their own optical errors on top); it is the honest geometric consequence of §9.2's expansion, pushed to an extreme. The question is never "is distortion bad?" It is "is this distortion saying what I want?"
A close cousin worth naming on its own: foreshortening — the visual compression of an object or limb along the axis pointing toward the camera, so that a thing extending toward the lens looks shorter and bulkier than it is. Point a hand straight at the camera and the arm seems to vanish behind a giant palm; the arm is foreshortened. Lie down and shoot up a person's body from their feet and the feet are huge, the head tiny and far — extreme foreshortening. Foreshortening is what makes near-the-lens body parts loom, and it is the reason close wide-angle portraits can look comic or grotesque: the nose, nearest the lens, enlarges while the ears recede.
So when do you use it?
- For drama and immersion. A wide lens close to a foreground rock with the landscape sweeping away behind it pulls the viewer into the scene — the exaggerated depth says "you are standing right here." This is the native look of dynamic landscape and architecture.
- For energy and exaggeration. A skateboarder shot from inches away with a wide lens, board thrust at the camera, becomes all motion and attitude — the distortion is the style.
- For scale. Exaggerated near-far difference can make a small foreground object monumental against a diminished background, or make an interior feel vast.
And when do you correct or avoid it?
- In portraits, almost always. Foreshortening enlarges whatever is closest — usually the nose — and most people do not want a portrait that enlarges their nose and shrinks their ears. The fix is not a better lens; it is distance. Back up and use a longer focal length (an 85mm-equivalent is the classic "flattering" portrait length) so the face is photographed from far enough away that near and far features sit at nearly equal distances and the proportions read true. This is §9.2 again: the flattering portrait "look" is really a distance, executed with a longer lens.
- In architecture and product, when accuracy matters. Converging verticals (a building's sides leaning together) and bowed lines can be exaggerated by close wide shots. Sometimes you want that drama; when you want the building to look straight and true, you correct it — by backing up, by keeping the camera level (sensor parallel to the subject), or in editing. (The full toolkit for fixing converging verticals and keystoning is the subject of Chapter 18.)
FIGURE 9.9 — Foreshortening: the same person, two distances (side view)
CLOSE + WIDE (foreshortened): FAR + LONG (true proportion):
nose ●──[CAM] [CAM]─────────────► ● face
(huge) ▲ (flattering portrait distance)
│ ears, far → tiny
╲ │ ╱ near and far features are nearly
(face leans/balloons) equidistant → face reads NATURAL
Near features (nose) loom; far features Backing up and zooming in restores
(ears) recede → caricature. true proportions. Distance is the fix.
The diagram restates the chapter's spine one more time: the difference between a flattering portrait and a caricature is not the lens — it's the distance, with the lens chosen to crop the result. A wide lens used far away introduces no foreshortening; a long lens used close (if you could) would. The distortion lives in the distance.
♿ Accessibility & Inclusion: Wide-angle distortion is not neutral when your subject is a person — it physically reshapes their face and body, and an unflattering distortion can read as unkind, especially shot without the subject's awareness. Two habits matter. First, photograph people at a respectful distance with a normal-to-long focal length unless they've agreed to a stylized close-wide look — it renders bodies of every size and shape with their true proportions. Second, when you write alt text or a Described Photograph of a person, describe them respectfully and accurately, and note the vantage ("shot from below, which lends the figure a monumental presence") so a non-sighted reader understands not just who is pictured but how the photograph positions them. The vantage you choose is an ethical choice, not only an aesthetic one — a theme we return to in Chapters 17 and 32.
🔗 Connection: The correction side of this section — fixing converging verticals, keystoning, and bowed lines so buildings stand straight — is a whole craft of its own, and we give it Chapter 18 ("Architecture & Real Estate"), which introduces perspective correction and tilt-shift technique. Here we only need you to see the distortion and decide, shot by shot, whether it serves you. There it becomes a set of tools.
🔄 Check Your Eye: 1. A friend dislikes a close-up portrait you made — they say their nose "looks huge." What caused it, and what is the fix (and what is not the fix)? 2. You want a dramatic, immersive landscape where a foreground flower looms against distant mountains. Close-and-wide, or far-and-long? Why?
Answers
- Foreshortening from shooting too close enlarged the nearest feature (the nose) and shrank the farthest (the ears). The fix is distance: back up and use a longer focal length so the face is shot from far enough that features sit at near-equal distances. The fix is not a "better" or sharper lens — the lens didn't cause it; the close distance did.
- Close-and-wide (expansion). Getting close makes the near flower loom enormous while the distant mountains stay small-but-present, exaggerating depth and pulling the viewer into the scene — exactly the immersive feel you want. Far-and-long would flatten the flower and mountains into the same plane.
9.6 Point of view as voice
We have spent this chapter on physical position — height, distance, angle, the discipline of moving. Now lift it one level, because all of that physical choosing adds up to something larger and more lasting. Your repeated choices about where to stand become a signature. We call it your point of view: not merely the literal spot the camera occupied, but the consistent stance toward your subjects — the recurring way of seeing — that, across a whole body of work, makes your photographs recognizably yours.
The phrase has two senses, and both matter. The literal sense is the one we've been building: point of view is the camera's physical position, the vantage from which a particular photograph was made. The deeper sense is metaphorical: point of view is your attitude, your way of looking at the world, expressed through the accumulation of all those physical choices. A photographer who habitually crouches to children's level, who gets close, who waits for the unguarded gesture, is expressing a point of view — I meet people as equals, I get intimate, I value the honest moment — as surely with their feet and their timing as a writer expresses one with their word choice. Over a hundred photographs, the physical pattern becomes the attitude, and the attitude is what we mean when we say a photographer has a voice.
This is why two photographers at the same event come home with different work even with identical cameras. One shoots from the edges, wide and observational, finding the small human moments in the corners; the other wades into the center, close and bold, filling the frame with faces. Neither is right. They have different points of view, and the point of view is the photographer. The camera recorded; the stance chose.
🚪 Threshold Concept: Where you stand is who you are, photographically. The thousand small decisions about height, distance, and angle are not just technique — accumulated, they are your voice. This is the bridge from craft to art: you stop asking only "how do I make this one frame good?" and start asking "what do I keep choosing, and what does that say about how I see?" You cannot fake a point of view and you cannot buy one. It emerges from making the choices in this chapter, consciously, thousands of times, until they become yours. We give this its own full treatment in Chapter 39, where voice and style become the subject — but it starts here, with the realization that your feet have an opinion.
How to develop a point of view (you don't choose it, you discover it). You do not sit down and decide "my point of view will be low and close." You find it by shooting a great deal, then looking back at your own work for the patterns you didn't know you had. The practice:
- Shoot widely and then audit. After working many scenes (§9.4), lay out your keepers and ask: what do I keep doing? Do I gravitate to high angles or low? Close or far? Centered or off-balance? Tidy or chaotic? The pattern is the seed of a voice.
- Then lean in deliberately. Once you notice a tendency you like, exaggerate it on purpose for a while. If you find you love low-and-close, shoot a whole project that way. Constraint sharpens voice faster than freedom.
- And test it against its opposite. Periodically shoot the way you don't — if you're always low, spend a day only shooting high. You'll either confirm your voice or expand it. Both are wins.
🔗 Connection: Point of view as voice is the through-line that connects this chapter to the very end of the book. The Light Log (begun in Chapter 5) and the working-the-scene habit (§9.4) feed it; Chapter 31 teaches you to read point of view in others' work through critique; Chapter 39 makes "style, voice, and vision" its entire subject; and the final Portfolio (Chapter 40) is, in the end, the document of your point of view, made visible across twenty to thirty images. Everything you do with your feet this chapter is depositing into that account.
Let's close the chapter where it began — with the red door — and watch point of view, not the door, become the subject.
🖼️ Read This Frame: Two photographers, one door, two voices.
text FIGURE 9.10 — "The red door, two points of view" [constructed teaching example] THE FRAME PHOTOGRAPHER A makes the worm's-eye monument from Figure 9.2 — low, aiming up, the door towering, sky capping the frame, a person's worshipful smallness implied. PHOTOGRAPHER B, same door, same hour, instead steps far back across the alley, shoots long, and catches a neighbor passing the door mid-stride, the compressed background stacking the alley's textures behind them — the door now a backdrop to a human moment. THE LIGHT The same raking afternoon side light for both; the light is a constant, the *stance* is the variable. THE MOMENT A's moment is the light on the static door; B's is the half-second of the passing stride — two different relationships to time, both valid. THE CHOICES A: low, close, wide, no people — a point of view that *monumentalizes objects and texture*. B: far, long, waiting for a person — a point of view that *finds the human moment in a place*. THE EFFECT A's photograph is about the door as a thing of weight and surface; B's is about the alley as a place where life passes a red door. Neither is more correct. They are two voices speaking about the same three square feet of the world. THE LESSON The subject was identical; the *point of view* — physical and attitudinal — made two different photographs and revealed two different photographers. The door never changed. The seeing did. This is the whole book in one red door.📸 In the Field — Your point-of-view audit. Open your camera roll or Portfolio folder and look at your last forty photographs as a set, not as singles. Tally: how many from standing eye level? How many low? How many high? Close or far? What backgrounds do you favor? Write three sentences describing the point of view your own work already has — the patterns you didn't plan. Then make a single new photograph that deliberately breaks that pattern (if you're always eye-level and far, go low and close). You are doing two things at once: discovering your voice and stretching it. Keep the breaking-frame; it might be the start of something.
Portfolio Checkpoint
Your Portfolio so far shows you can expose, see light, compose with thirds and lines, use color, and see in black and white. This chapter adds the proof that you can change a subject's meaning by where you stand — the most under-used control there is.
The assignment (a keeper): Photograph one subject from an unexpected vantage that changes its meaning. Pick any subject — an ordinary one is best, in the red-door spirit — and work it (§9.4): make the obvious eye-level frame first, then abandon it and explore height, distance, and angle until you find a vantage that makes the subject say something it doesn't say from the default view. The keeper is the frame where the vantage itself is the idea: the kitchen chair that becomes a monument from the floor; the puddle that becomes a sky from above; the crowd that becomes a pattern from the balcony; your own red door looked up at like a cathedral. Shoot many; keep one.
Why this image belongs: Every other keeper in your Portfolio so far could, in principle, have been made from standing eye level. This one cannot — its whole reason for existing is the vantage. It demonstrates that you understand the chapter's thesis not as a fact but as a tool: that perspective and point of view are controls you reach for on purpose. A portfolio reviewer (and Chapter 40's curation pass) reads an unexpected-but-earned vantage as a sign of a seeing photographer, not a pointing one.
Curation note: Place this image beside your Chapter 6 composition keeper and your Chapter 8 black-and- white frame and notice the progression — you are accumulating images that each demonstrate a different deliberate control, which is exactly what range means. In your running list, record the subject, the vantage you chose, and one sentence on what the vantage changed about its meaning. That sentence is the real deliverable; the image is its evidence.
Summary
This chapter put your feet back in your hands. The controls here cost nothing, work on any camera, and change a photograph's meaning more reliably than almost anything you can buy.
- Camera height is the first vantage decision. Worm's-eye (low, aiming up) makes a subject loom — powerful, heroic, monumental — and often swaps clutter for sky. Bird's-eye (high, aiming down) flattens and maps a subject — small, graphic, surveyed — and is the native language of the flat-lay. Eye level meets the subject as an equal and is the honest default — but a terrible only choice. Rule: for any subject worth a photograph, make at least one frame from a height that is not your standing eye level.
- Perspective comes from distance, not focal length. Perspective — the relative sizes and stacking of near and far — is set entirely by where you stand. Compression (the stacking, flattening look) comes from shooting far away; you reach for a telephoto only to crop into it. Expansion (the looming, opened- out look) comes from shooting close; you reach for a wide lens only to fit it in. Move to set the perspective; use the lens to frame it.
- Angle is background. For any fixed subject, a small change in your position changes what falls behind it. Step left/right to slide distractions away; crouch to swap clutter for sky; rise to swap it for ground; circle until good light and a clean background coincide. Always look past your subject to the frame edges before you shoot, and take the one step that fixes the merger.
- Work the scene; move to subtract. Never shoot one frame and leave. Make the safe shot, abandon it, then change height, distance, angle, and orientation — subtracting clutter with every move — until only the picture remains. Shoot many, keep three, say why. The keeper is almost never the obvious first frame.
- Wide-angle distortion and foreshortening are tools, not flaws. Close-and-wide exaggerates depth and enlarges near features — magnificent for immersive landscapes and energy, unflattering for faces. The fix for an unflattering portrait is distance (back up, shoot longer), never a "better" lens. Decide shot by shot whether the distortion serves the picture.
- Point of view is voice. Your accumulated choices about where to stand become a recognizable stance toward your subjects. You don't choose a point of view; you discover it by shooting widely and auditing your own patterns, then leaning in. Where you stand is, over a body of work, who you are photographically.
| Vantage | Camera position | Subject reads as | Best for | Watch out for |
|---|---|---|---|---|
| Worm's-eye | Low, aiming up | Large, powerful, heroic | Monuments, kids, drama, clean-sky backgrounds | Converging verticals (unless wanted) |
| Eye level | Level with subject | Natural, equal, direct | Honest portraits, documentary | Becoming your only default |
| Bird's-eye | High, aiming down | Small, mapped, graphic | Flat-lays, patterns, crowds, tenderness | Flattening away wanted depth |
| Close + wide | Near, short lens | Immersive, exaggerated depth | Dynamic landscape, energy, scale | Foreshortening faces unflatteringly |
| Far + long | Distant, long lens | Compressed, stacked, flat | Stacked layers, giant-sun shots, flattering portraits | Losing the sense of being there |
Spaced Review
Test yourself on earlier chapters without scrolling back — retrieval is what moves knowledge into your hands.
- (Chapter 6.) Name three of composition's core tools for directing the viewer's eye, and say in one line what each does.
- (Chapter 4.) What does a wider aperture (a smaller f-number) do to depth of field, and how would you use that to simplify a busy background — and how is that different from changing the background by moving (this chapter, §9.3)?
- (Chapter 6.) What is negative space, and how does the working-the-scene habit in §9.4 help you find it?
- (Chapter 4.) If you wanted both the near foreground flower and the distant mountains sharp in an expansion shot, which way would you move the aperture, and what is the trade-off?
Answers
1. Any three of: *rule of thirds* (placing key elements off-center on the third-lines/intersections for balance and energy); *leading lines* (lines that draw the eye toward the subject); *visual weight / balance* (distributing elements so the frame doesn't tip); *framing* (using foreground elements to surround the subject); *negative space* (empty area that isolates and emphasizes the subject). 2. A wider aperture (smaller f-number) gives *shallower* depth of field, blurring the background to push it out of competition with the subject. That *blurs* a background; §9.3's move *changes which background is there at all* — the strongest images do both: choose a clean background by angle, then soften it with aperture. 3. *Negative space* is the empty, uncluttered area around a subject that isolates and emphasizes it; working the scene helps you find it because moving (crouching, stepping, backing up) lets you subtract clutter and place the subject against simple, open areas. 4. Move the aperture to a *higher* f-number (smaller opening, e.g. f/11–f/16) for *deeper* depth of field so both near and far are sharp; the trade-off is less light reaching the sensor, requiring a slower shutter or higher ISO to compensate.What's Next
You now command the two great spatial controls — where you stand (this chapter) and what you include (composition, Chapter 6). One control is still missing, and it is the one that turns a photograph of a place into a photograph of life: time. In Chapter 10 we take on the decisive moment — the art of when to press the shutter. We'll study how Henri Cartier-Bresson built a whole philosophy around the split second when form and content align, learn to anticipate a scene before it peaks rather than react after it's gone, and discover that in photographs of people and motion, timing is not a detail of the picture — it is the picture. You have learned to choose your position; next you learn to choose your instant.