Case Study 02: Wars, Pandemics, and City Sizes — Power Laws of Human Systems

Case Study 02: Wars, Pandemics, and City Sizes — Power Laws of Human Systems

Context: This case study accompanies Chapter 4 (Power Laws and Fat Tails). It examines three power law distributions in distinctly human domains — armed conflict, disease transmission, and urban growth — and asks what the recurrence of the same mathematical pattern across these domains tells us about the deep structure of human systems.

I. Richardson's Ghost: The Power Law of War

A Quaker Counts the Dead

Lewis Fry Richardson was a man of contradictions. A devout Quaker and pacifist, he served as an ambulance driver on the Western Front in World War I, witnessing firsthand the industrialized slaughter of the trenches. A mathematician and physicist by training, he made foundational contributions to weather forecasting and turbulence theory. And after the war, he turned his mathematical mind to the question that haunted him for the rest of his life: could the causes of war be understood quantitatively?

Richardson spent decades compiling a database of every fatal human conflict he could find records of, from the smallest recorded skirmishes (with as few as three deaths) to the two World Wars (with deaths in the tens of millions). He published his findings posthumously in 1960, in a book titled Statistics of Deadly Quarrels.

What Richardson found was a pattern that most people, then and now, find deeply disturbing. When he plotted the number of conflicts at each death toll level — how many conflicts killed between 1 and 10 people, how many killed between 10 and 100, between 100 and 1,000, and so on — the result was a power law. On a log-log plot, the data formed a reasonably straight line.

The implication was stark: there is no qualitative boundary between small violence and catastrophic violence. The bar fight, the riot, the civil war, and the world war sit on the same statistical continuum. The frequency of wars with a given death toll decreases as a power of that death toll, following the same mathematical form as earthquake frequencies, city sizes, and book sales.

What the Power Law Does and Does Not Say

It is essential to be precise about what Richardson's power law means and what it does not.

It does not mean that wars are random. Every war has specific causes — political, economic, ethnic, territorial. Richardson's law does not erase those causes; it describes the aggregate statistical pattern that emerges when you look at all conflicts together.

It does not mean that large wars are inevitable. The power law tells you the probability of a war of a given size, given the observed frequency of smaller wars. A war that kills 100 million people is very unlikely in any given decade — but it is not the infinitesimally unlikely event that a Gaussian model would predict. The fat tail gives it a real, if small, probability.

It does mean that our sense of safety is exaggerated. When people point to the decades since World War II and argue that large-scale war is a thing of the past, Richardson's power law issues a caution. The fact that we have not experienced a conflict on the scale of World War II in 80 years is consistent with the power law prediction — such events are rare — but it does not mean the underlying probability has changed. A seismologist would not look at 80 years without a magnitude-9 earthquake in a given region and conclude that magnitude-9 earthquakes are no longer possible.

The Mechanism: Cascading Feedback in Networks of Alliances

What generates the power law in war casualties? The answer connects directly to the feedback loops of Chapter 2 and the network structures that generate power laws through preferential attachment.

Wars do not start at their final scale. World War I began as a regional dispute between Austria-Hungary and Serbia. It escalated to continental and then global scale through a network of alliances and obligations: Russia mobilized to support Serbia; Germany mobilized to support Austria-Hungary; France was drawn in by its alliance with Russia; Britain was drawn in by its commitment to Belgian neutrality. Each new entrant increased the scale of the conflict, which created pressure for further escalation.

This is a cascading feedback process, structurally similar to the cascading failures we discussed in Chapter 2 (the fire-sale spiral in the 2008 financial crisis) and to the preferential attachment of Chapter 4 (larger conflicts draw in more participants, which makes them larger still). The power law in war casualties emerges because the probability of a conflict escalating to the next scale depends on the interconnectedness of the system — the density of the alliance network, the degree of economic interdependence, the contagion of fear and hostility through communication channels.

In a world with few international connections, most conflicts would remain small. In a highly interconnected world, a small conflict has a greater probability of cascading through the network and becoming large. The power law exponent may depend on the degree of global interconnection — a hypothesis with sobering implications for our increasingly connected world.

II. Pandemics: The Fat Tail That Keeps Returning

The Uneven Spread

The chapter's discussion of superspreaders introduced the idea that pandemic transmission follows a fat-tailed distribution. This case study examines the pattern in greater depth.

Consider the classic epidemiological model: each infected person transmits the disease to R0 others, on average. For COVID-19, R0 was estimated at 2-3, meaning each infected person infected, on average, 2 to 3 others. This average, however, conceals a distribution that is strikingly unequal.

Studies of COVID-19 transmission found that approximately 10-20 percent of infected individuals were responsible for 80 percent or more of secondary infections. Many infected people transmitted the virus to nobody at all. A few transmitted it to dozens — in choir practices, in meat-packing plants, at wedding receptions, in crowded bars. The distribution of individual transmission events (the number of secondary infections caused by each infected person) followed a fat-tailed distribution, often described by a negative binomial with a high dispersion parameter — mathematically similar to a power law for the purposes of understanding tail risk.

Why Superspreaders Matter More Than R0

The existence of superspreader dynamics transforms the strategic landscape of pandemic response.

If transmission were Gaussian — if every infected person infected approximately the same number of others — then the optimal response would be to reduce everyone's transmission by a modest, uniform amount. Universal masking, social distancing, and hygiene measures would be the primary tools, and their effects would be roughly proportional to their adoption.

But if transmission is fat-tailed, the picture changes dramatically. The superspreader events in the fat tail contribute so much to overall transmission that targeting them specifically — banning large indoor gatherings, improving ventilation in high-risk settings, implementing backward contact tracing to identify superspreading events — can reduce total transmission far more efficiently than uniform measures applied to the entire population.

This is the public health analogue of the principle from the chapter: in Extremistan, target the tail. The average transmission event is not the one driving the pandemic. The rare, extreme transmission events are. Policy designed for the average case will be inefficient; policy designed for the tail case will be effective.

Japan's response to COVID-19 in its early stages was informed by exactly this insight. Rather than focusing exclusively on forward contact tracing (finding who the infected person might have exposed), Japanese epidemiologists emphasized backward contact tracing — asking where the infected person had been exposed, and identifying the settings and events that produced clusters of transmission. This approach targeted the superspreader events directly, and Japan achieved notable success in controlling early transmission despite minimal lockdown measures.

The Historical Pattern: Pandemics as Fat-Tailed Events

Zoom out from individual transmission events to the scale of entire pandemics, and the fat tail appears again.

The history of infectious disease is dominated by a handful of catastrophic events. The Black Death of 1347-1351 killed an estimated 30-60 percent of Europe's population. The 1918 influenza pandemic killed an estimated 50-100 million people. The HIV/AIDS pandemic has killed more than 40 million people since the 1980s. These events are separated by long periods of relative epidemiological calm, punctuated by smaller outbreaks that, while serious, are orders of magnitude less devastating.

The distribution of pandemic severity — the number of deaths caused by each pandemic event — appears to follow a fat-tailed distribution, though the small sample size of truly catastrophic pandemics makes precise statistical characterization difficult. What is clear is that the history of pandemics is not well described by a Gaussian. It is not a smooth bell curve of equally spaced events. It is a long, quiet baseline punctuated by rare but devastating spikes — exactly the pattern a fat-tailed distribution predicts.

The implications for pandemic preparedness are the same as for earthquake engineering: prepare for the tail, not the historical average. The fact that no pandemic as deadly as the 1918 influenza has occurred in more than a century does not mean such an event is implausible. The fat tail of the distribution suggests that another catastrophic pandemic is not a matter of if but of when — and that our preparations should be calibrated to the tail, not to the typical.

III. Cities: The Power Law of Human Gathering

Zipf's Law in Urban Form

The chapter introduced Zipf's law for city sizes — the observation that the largest city in a country is roughly twice the size of the second-largest, three times the third-largest, and so on. This case study examines the mechanism more closely and considers its implications for urban planning and policy.

Zipf's law for cities is one of the most robust empirical regularities in the social sciences. It has been observed in virtually every country with reliable population data, across different historical periods, and at different levels of economic development. The regularity is so consistent that deviations from Zipf's law are themselves informative — they often indicate unusual historical circumstances (such as a country with a single dominant primate city, like France with Paris, or a country with an artificially divided capital, like Germany during the Cold War).

Why Cities Follow Power Laws

The mechanism that generates the power law in city sizes is a form of preferential attachment, operating through multiple reinforcing feedback loops:

Economic agglomeration. Large cities offer more diverse job markets, more specialized services, deeper pools of talent, and more opportunities for economic exchange. These advantages attract workers and businesses, which make the city larger, which deepens the advantages further. This is the urban economist's concept of "agglomeration economies" — and it is preferential attachment in economic form.

Infrastructure and institutions. Large cities have more developed infrastructure (transportation networks, hospitals, universities, cultural institutions) than small ones. This infrastructure attracts further investment and migration, which supports further infrastructure development. Another reinforcing loop.

Network effects. Cities are, among other things, networks of social connections. A larger city offers more potential connections — more potential friends, partners, collaborators, customers, and employers. The value of being in a city increases with the city's size, which attracts more people, which increases the size further. This is Metcalfe's law (the value of a network is proportional to the square of the number of nodes) intersecting with preferential attachment.

Cultural gravity. Large cities develop cultural identities, reputations, and brands that attract further migration. New York is "the city that never sleeps"; London is a global financial center; Tokyo is a technological hub. These narratives are self-reinforcing: they attract the people who make them true, and the truth of the narrative reinforces its attractiveness.

Each of these mechanisms is a positive feedback loop. Together, they produce a system where large cities grow faster than small cities — not because they are "better" in any absolute sense, but because their existing size generates advantages that attract further growth. This is preferential attachment operating in urban space, and it generates a power law distribution of city sizes with the same mathematical inevitability that preferential attachment generates power law distributions in networks, wealth, and book sales.

The Constraints: Why Cities Do Not Grow Without Limit

If preferential attachment were the only force, the power law predicts that the largest city would grow without bound, eventually absorbing the entire population. This does not happen, because balancing feedback loops (Chapter 2) constrain urban growth:

Congestion and cost of living. As a city grows, commute times increase, housing costs rise, pollution worsens, and the quality of life for many residents declines. These diseconomies of scale push people away from the largest cities, counteracting the pull of agglomeration.

Policy and planning. Governments sometimes actively limit the growth of large cities (through zoning laws, green belts, investment in regional cities) or promote the growth of smaller ones.

Diminishing returns. At some point, the marginal benefit of a city's additional size diminishes — the ten-millionth resident adds less to the city's agglomeration advantage than the millionth resident did.

The power law emerges from the balance between the reinforcing loops of agglomeration and the balancing loops of congestion and diminishing returns. The exponent of the power law — how steep the distribution is, how dominant the largest cities are relative to the smaller ones — depends on the relative strength of these competing forces.

This is a direct application of the interacting-loops framework from Chapter 2. The city size distribution is not the result of a single force but of multiple feedback loops operating simultaneously, some reinforcing and some balancing. The power law is the emergent outcome of this dynamic equilibrium.

Implications for Policy

Understanding that city sizes follow a power law — and understanding the feedback mechanisms that generate it — has important implications for urban policy.

Attempts to create "balanced" city systems are fighting the mathematics. Governments that try to prevent the growth of large cities and divert migration to smaller ones are pushing against a powerful positive feedback loop. Such policies can work at the margin, but they cannot transform a power law distribution into a Gaussian one. The fundamental dynamics of agglomeration ensure that some cities will always be dramatically larger than others.

Infrastructure planning should account for the tail. Just as earthquake engineering should design for the rare catastrophic event, urban infrastructure planning should account for the possibility that the largest cities will be much larger than simple extrapolation from current trends suggests. Megacity growth in the developing world — Lagos, Dhaka, Kinshasa — is following the power law trajectory, and infrastructure planning that assumes moderate, Gaussian-like growth will be overwhelmed.

The long tail of small cities matters. Just as Chris Anderson's "long tail" revealed the aggregate economic value of niche products, the long tail of small cities and towns contains enormous aggregate population and economic value. Policies that focus exclusively on the handful of largest cities overlook the majority of the population living in the tail.

IV. The Unifying Thread

Three Domains, One Pattern

Wars, pandemics, and city sizes have nothing in common in their substance. Wars are political. Pandemics are biological. Cities are geographic and economic. And yet all three follow the same mathematical distribution: a power law with fat tails, where extreme events are rare but dominant, and where the average is a misleading guide to the reality.

The pattern recurs because the underlying process recurs. In all three domains:

There is a system of interconnected elements — nations in an alliance network, individuals in a contact network, people in an economic network.
There is a positive feedback mechanism — escalation cascades in war, superspreader dynamics in pandemics, agglomeration economies in cities.
The feedback amplifies initial advantages or perturbations — a small conflict draws in allies, a single superspreader infects dozens, a slightly larger city attracts disproportionate migration.

The result, in every case, is a distribution where the extreme events — the world wars, the global pandemics, the megacities — are not aberrations but the inevitable tail of a process that operates at every scale.

This is the thesis of Chapter 4, illustrated in the domain of human systems: the same mathematical curve appears wherever positive feedback operates in a growing, interconnected system, regardless of what the system is made of or what it does.

Discussion Questions

Richardson's power law suggests that the probability of a catastrophic war, while small, is nonzero and cannot be assumed to be decreasing over time. How should this inform nuclear weapons policy? What is the difference between "no large war has occurred in 80 years" and "large wars are no longer possible"?

Japan's backward contact tracing strategy during COVID-19 targeted superspreader events specifically. What other policies or interventions could specifically target the fat tail of pandemic transmission? How does targeting the tail differ from uniform population-wide measures?

Some countries (like France) have a single dominant "primate city" (Paris), while others (like Germany) have a more distributed city size distribution. How might historical factors (centralization of government, war, geography) affect the exponent of the power law for city sizes?

The case study argues that all three domains — war, pandemics, cities — share the structural features of interconnected elements, positive feedback, and amplification of initial advantages. Can you identify these three features in a fourth domain not discussed in this case study? How does the power law manifest there?

What are the ethical implications of Richardson's discovery? If catastrophic wars are a statistical feature of the international system rather than the result of individual evil, does this change how we think about responsibility, prevention, and the possibility of lasting peace?