Case Study 27-02: The Chess Experiments — How Experts See Differently
A Puzzle About Memory
In the mid-1940s, Dutch psychologist Adriaan de Groot set out to understand how chess grandmasters think. His initial question was straightforward: do the best chess players simply calculate more moves ahead than weaker players?
The answer he got was not straightforward at all. And it changed cognitive science.
De Groot's method was to show chess positions to players of varying skill levels — grandmasters, masters, club players, and beginners — and ask them to find the best move while thinking aloud. He recorded their verbal protocols in meticulous detail. What he expected to find was that grandmasters calculated further, considering more possibilities.
What he actually found: grandmasters did not calculate more deeply on average. They considered fewer candidate moves, not more. But almost every move they considered was a good one. Weaker players considered more moves — including many bad ones — before arriving at their choice. The grandmaster's search was narrower, faster, and more accurate.
This suggested something important: the grandmaster wasn't computing better. The grandmaster was seeing better. The first move they noticed was usually the right move, because their perception of the board had been organized by years of pattern recognition into something that quickly surfaced the relevant possibilities.
De Groot's secondary experiment confirmed this. He showed positions to players of different skill levels for five seconds, then asked them to reconstruct the position from memory. The results were startling: grandmasters reconstructed the positions almost perfectly. Beginners remembered a handful of pieces.
But the crucial follow-up — the random position test — is where the science got decisive.
Chase and Simon's Definitive Study
In 1973, William Chase and Herbert Simon at Carnegie Mellon University replicated and extended de Groot's work with greater methodological rigor. Their paper, "Perception in Chess," published in Cognitive Psychology, became one of the most cited papers in cognitive science.
Chase and Simon's central innovation: they added the random position condition.
When board positions were drawn from real games — positions that had actually occurred or could plausibly occur — the results matched de Groot's findings. Masters (Class A players in their study, slightly below grandmaster level) recalled around 90% of the pieces. Beginners recalled around 30–40%.
When board positions were random — pieces scattered across the board in configurations that would never arise in actual play — the difference collapsed. Masters recalled roughly 30–35% of the pieces. So did beginners. The advantage evaporated.
This result ruled out the "better memory" hypothesis definitively. If masters had superior chess-specific memory because they had better memories in general, the advantage would have persisted with random positions. It didn't.
The explanation Chase and Simon proposed — building on earlier work by cognitive psychologist George Miller — was chunking. Experts encode information in larger, more meaningful units. A random scatter of pieces is just pieces — there are no meaningful units to encode, so everyone is stuck holding pieces in working memory one by one, and working memory's capacity is limited (Miller's famous "seven plus or minus two" chunks).
But a real game position is not pieces. It's a collection of configurations — formations, attack vectors, defensive structures, piece relationships — that have been encountered in various forms thousands of times. The master doesn't see a rook, a bishop, and three pawns. They see a queenside minority attack. That's one chunk, not five pieces. The cognitive economy is dramatic.
The Size and Content of the Expert's Chunk Library
Chase and Simon estimated that expert chess players have approximately 50,000 chunks stored in long-term memory — meaningful configurations of pieces that they have learned to recognize as units. Later researchers, including Simon himself in subsequent work with Fernand Gobet, refined this estimate upward, suggesting the number for grandmasters might be in the range of 100,000 or more.
Each chunk in the library is not just a geometric configuration. It carries with it: - Strategic implications: what this formation tends to mean about the game's direction - Tactical threats: what attacks or defenses this formation enables or demands - Associated history: variants of this configuration the player has seen in their study of master games - Typical response patterns: the categories of moves that tend to work well or poorly against this configuration
This is the architecture of expert perception. When a grandmaster sees a position, they are not scanning 32 pieces in isolation. Their pattern recognition system is firing on configurations — meaningful units that immediately suggest the story of the position and the relevant moves to consider.
De Groot used the metaphor of reading. A beginning reader sees letters, then struggles to decode them into words, then combines words into sentences with effort. An expert reader sees phrases — meaningful chunks — directly, and the comprehension is nearly effortless. The master chess player reads the board as an expert reader reads text. The information arrives organized.
Why Experts Are "Seeing Differently," Not "Seeing More"
A common intuition about expertise is that experts simply hold more information in their heads. The expert doctor knows more diseases; the expert lawyer knows more cases; the expert chess player knows more positions. This intuition is partially right but fundamentally misleading.
The more important difference is perceptual, not informational. Experts have restructured perception itself.
Chase and Simon's analysis of grandmaster move selection revealed a feature that is easy to overlook: grandmasters rarely considered a move they would immediately evaluate as bad. They didn't need to consciously reject bad moves, because their pattern recognition system filtered them before they reached consciousness. The grandmaster's attention was drawn, almost magnetically, to the subset of moves that were worth considering.
This is qualitatively different from what the beginner experiences. The beginner's attention is equally distributed across the board. Every piece is, in some sense, equally salient. They must consciously evaluate many options to find the good ones. Their cognitive effort is spent on a triage task that the expert's pattern recognition handles automatically and nearly instantly.
The implication for luck: the expert doesn't just know more about what constitutes opportunity. They perceive the opportunity-rich areas of a situation more richly, and they perceive the noise as noise. Their perceptual system has been reorganized by expertise so that relevant signals are louder and irrelevant noise is quieter.
Apply this to any domain: the expert investor whose pattern library "lights up" when a business model fits a specific profile; the expert editor whose eye catches the awkward sentence without consciously parsing every word; the expert negotiator who reads the shift in body language before the other party has consciously decided anything. These are not magical gifts. They are the output of a perceptual system reorganized by the accumulated weight of pattern encoding over years.
Chunking Theory Beyond Chess
Chase and Simon's original paper was careful to note that chunking was not a chess-specific phenomenon. They proposed it as a general theory of expertise, applicable across any domain with enough complexity and regularity to support pattern learning.
Subsequent decades of research have validated this view across an impressive range of domains:
Medical diagnosis: Expert radiologists show chunking in X-ray reading. They encode meaningful configurations — abnormal lung patterns, specific bone density profiles, characteristic shadow shapes — rather than raw image information. Studies using eye-tracking technology found that experts look at fewer locations in an X-ray but extract more information from each location. Their gaze moves directly to the diagnostically relevant regions.
Physics problem-solving: Chi, Feltovich, and Glaser (1981) found that expert physicists and novice physics students categorized problems differently. Novices categorized by surface features (problems involving inclined planes, problems with springs). Experts categorized by underlying principles (conservation of energy problems, Newton's second law problems). The experts were chunking at a deeper structural level.
Bridge and other card games: Research on expert bridge players showed the same random-versus-meaningful pattern as chess. Experts recalled dealt hands (meaningful patterns) far better than randomly arranged cards. Their chunk libraries encoded the configurations of cards likely to arise in bridge deals.
Music: Expert musicians reading sheet music chunk multiple notes into melodic or harmonic units. Beginning readers process notes individually. The expert musician sees a diminished seventh chord where the beginner sees four separate noteheads.
Software programming: Shneiderman (1976) found that expert programmers remembered meaningful code segments better than scrambled code segments, while novices showed no such difference. Experts had encoded chunks of syntactically and semantically meaningful code patterns.
The 10-Year Rule and Pattern Library Construction
Chase and Simon's work prompted one of the most durable empirical findings in the psychology of expertise: no one reaches grandmaster-level pattern recognition in chess in fewer than approximately ten years of intensive study. The same threshold holds, with variation, across other complex domains.
This is the 10-year rule — not ten years of playing, but ten years of the kind of intensive, feedback-rich, deliberate practice that allows pattern libraries to be built systematically. The chess player who studies master games, analyzes their own games, solves tactical puzzles, and reviews opening theory is building the chunk library. The chess player who simply plays games against willing opponents is building experience, but not the same kind of chunk library.
The research on deliberate practice by Anders Ericsson and colleagues refined this further: the key variable is not years but hours of deliberate practice, and the quality of that practice. A chess player who spends ten years playing but rarely studying master games will have a much smaller chunk library at the end of a decade than one who spends ten years in systematic study.
For our purposes, the implication is direct: the pattern library that enables lucky insights is not built quickly, and it is not built by mere experience. It is built by deliberate study of the patterns in a domain, combined with feedback that calibrates the library over time.
The Recognition-Primed Decision Connection
Chase and Simon's chunking theory connects directly to Klein's recognition-primed decision-making (discussed in the main chapter). The chunk library is the mechanism that makes recognition-primed decision-making possible.
When the firefighter enters the room and recognizes, in less than a second, that the heating pattern is wrong — that recognition is a chunk-level event. The firefighter's pattern library contains chunks for "normal room fire" and "basement fire burning through sub-floor." The current situation is matching or failing to match those chunks simultaneously, and the mismatch is what generates the feeling of wrongness.
When Fleming looked at the contaminated petri dish and recognized the spatial pattern of diffusion-mediated bacterial death — that recognition was a chunk event. His library contained the chunk for "diffusible antibacterial substance effect." The dish matched the chunk. The match produced the recognition.
The chess grandmaster, the veteran firefighter, the expert bacteriologist, and the behavioral economist who once played professional poker — these are all people who have spent years building chunk libraries through the kind of practice that encodes patterns into long-term memory at a high density.
The lucky insights attributed to each of them are, in the language of Chase and Simon's framework, chunk matches. The environmental trigger — the board position, the contaminated dish, the unusual room, the behavioral anomaly at the poker table — matched a chunk in a prepared library. The match generated recognition. The recognition generated action.
The luck was in the trigger. The prepared mind was what made the trigger matter.
Limitations and Critiques of Chunking Theory
In the interest of intellectual honesty, it's worth noting that chunking theory, while foundational, has been challenged and refined in subsequent decades.
Chunking theory doesn't fully explain sequence. De Groot and Chase and Simon focused on static positions. But chess play involves dynamic sequences — the order of moves matters enormously. Gobet and Simon later proposed "template theory" as an extension of chunking that better accounts for how experts encode dynamic patterns across moves, not just static configurations.
Individual differences are larger than originally acknowledged. While the general finding that experts have larger chunk libraries is robust, the size of the library needed to reach a given performance level varies considerably across individuals. The 50,000 chunk estimate is an average; some grandmasters may achieve grandmaster strength with a smaller chunk library coupled with superior calculation ability.
The role of explicit knowledge. Chess grandmasters don't just have implicit pattern libraries — they have large amounts of explicit, verbally encoded knowledge about openings, endgames, and strategic principles. The relationship between implicit chunking and explicit knowledge is more intertwined than early versions of chunking theory suggested.
These refinements don't undermine the core insight. They deepen it. Expert perception is still qualitatively different from novice perception. Chunk libraries are still real and still consequential. The mechanism is simply more complex than a single theory can capture.
Discussion Questions
-
Chase and Simon's random position test was the decisive piece of evidence. Why was this control condition so important? What alternative explanations does it rule out?
-
The chapter describes expert perception as "seeing differently, not just seeing more." What are the practical implications of this distinction for someone trying to develop expertise?
-
The 10-year rule is sometimes interpreted to mean "anyone can become an expert with enough practice." But the research actually says something more nuanced. What does it say, specifically, and how does that nuance matter?
-
Consider a domain you'd like to develop expertise in. What would a "chunk" look like in that domain? What categories of pattern would a deep practitioner recognize that a novice would not?
-
De Groot found that grandmasters considered fewer candidate moves but chose better ones. What does this tell us about the relationship between breadth of consideration and quality of decision-making? When is it better to consider more options versus fewer?