Appendix G: A Timeline of Forensic Science — Advances, Failures, and Reforms
This timeline is meant to be read two ways at once, because the history of forensic science only makes sense if you hold both threads together. One thread is the long climb of genuine scientific advance — the microscope, the fingerprint, the discovery that DNA varies between people and can be read from a few cells. The other thread, running right alongside it and usually a step behind, is the slow, painful reckoning with everything the field got wrong: the methods that were never validated, the confident experts who were certainly mistaken, the laboratories that cut corners, and the innocent people who paid for all of it. A chronology that tells only the first thread is a recruiting brochure. A chronology that tells only the second is a polemic. The real story is the two of them braided, because the same field produced DNA profiling and bite-mark testimony, and learning to tell those apart is the whole point of the book this appendix closes.
Dates here are given precisely where they are well established and qualitatively where a precise date would imply more certainty than the record supports. The landmark reports and court decisions — the NAS 2009 report, the 2016 PCAST report, Frye, Daubert, Kumho Tire, Melendez-Diaz — and the named, publicly documented cases are matters of public record and are cited as such throughout the book. Where a "first" is claimed in the popular literature but contested by historians, the entry says so rather than pretending to a precision the evidence has not earned.
A note on reading the two threads. Watch how often a celebrated advance and a catastrophic failure sit only a few years apart. The 1980s gave us both DNA profiling — the one rigorously quantified method in the whole field — and the high-water mark of bite-mark testimony, which the same courts admitted with the same confidence. The lesson is not that forensic science is fraudulent. It is that prestige and validity are different things, and the field spent most of its history unable, or unwilling, to tell them apart. That is the yardstick — where a method sits on the validity spectrum, and how we know — that the modern reckoning finally forced into the open.
The long foundations (antiquity to the 1800s)
The idea that physical traces can answer a legal question is old, but the science of it is young. For most of human history, guilt was decided by confession, witness, oath, ordeal, or torture — not by evidence read from the scene. The shift from those methods to the analysis of physical traces is, in the long view, what "forensic science" actually names, and it happened mostly in the last two centuries.
- Antiquity through the medieval period. Scattered anticipations appear — early medicolegal manuals in imperial China describing how to distinguish a drowning from a strangulation, Roman and medieval inquests into suspicious death — but these are precursors, not a discipline. They establish the question (what does the body tell us?) long before there is a method to answer it reliably.
- The nineteenth century: chemistry enters the courtroom. As analytical chemistry matured, it became possible to test for a poison rather than infer it from symptoms. The development of reliable chemical tests for arsenic in the early-to-mid 1800s marks one of the first times a laboratory method, rather than a witness, settled a question of fact in a criminal trial — the deep ancestor of the toxicology in Chapter 20. Across the same century, the systematic study of the body after death matured into forensic pathology, and the comparison microscope and early questioned-document work began to take shape.
- Anthropometry — the first attempt at identification. In the 1880s, a French records clerk devised a system of standardized body measurements to identify repeat offenders, the first serious attempt to give each person a unique, recordable identity for the justice system. It worked imperfectly and was soon eclipsed — but it established the ambition that fingerprints and later DNA would fulfill: a way to tie a trace to one individual.
The through-line of the foundations is the slow replacement of authority ("a respected witness says so") with method ("a test shows so"). That replacement is the great achievement of forensic science. The rest of this timeline is the discovery that a method is only as good as its validation — a lesson the field kept having to learn the hard way.
The classical era: principles and pattern methods (c. 1890s–1950s)
This is the era that built the disciplines the public still pictures, and it is also where the field acquired several methods it would spend the next century trying to validate after the fact. The pattern was set early: a technique would be introduced, prove useful in casework, gain courtroom acceptance through sheer familiarity, and only much later be asked the awkward question — what is your error rate, and how do you know?
- Fingerprint identification takes hold (around the turn of the twentieth century). Building on nineteenth-century work establishing that friction-ridge patterns are persistent and highly variable between individuals, fingerprinting was adopted by police agencies as a means of identification in the first decades of the 1900s and quickly displaced anthropometry. It became, and long remained, the "gold standard" — a status Chapter 14 examines critically, because the underlying comparison is a human judgment, not the automatic certainty television implies.
- Locard and the exchange principle (early 1900s). Edmond Locard, who established an early forensic laboratory in Lyon, France, articulated the principle that underlies the entire field: every contact leaves a trace. Locard's exchange principle (Chapter 3) is the reason there is such a thing as physical evidence to collect at all — and the honest modern reading adds the caution that a trace's presence is not the same as a trace's meaning.
- The comparison microscope and firearms identification (1920s). The development of the comparison microscope let examiners view two specimens side by side in a single optical field, enabling the side-by-side comparison of bullets and cartridge cases that founded firearms and toolmark examination (Chapter 15). The method's real power — and the limits of "matching" striations to a single weapon — would not be rigorously examined for decades.
- The institutional crime laboratory (1920s–1930s). Police crime laboratories were established in this period in Europe and the United States, including the founding of the U.S. Federal Bureau of Investigation's laboratory in the 1930s. The crime lab as an institution — with its sections, its workflow, and eventually its accreditation and its scandals (Chapter 4) — dates from here.
- The Frye standard (1923). In Frye v. United States, a federal appeals court rejected an early lie-detector technique and articulated the rule that scientific evidence is admissible only if the method has gained "general acceptance" in its relevant field. The Frye standard (Chapter 5) governed the admissibility of scientific evidence in much of the United States for decades and still governs in some jurisdictions. Its weakness, visible in hindsight, is that general acceptance measures a method's popularity among its own practitioners, not its validity — which is exactly how unvalidated pattern methods stayed in court so long.
By the end of this era the field had its disciplines, its laboratories, and its courtroom gate. What it did not have — and would not seriously demand for another half-century — was a habit of asking each method to prove, with measured error rates, that it could do what it claimed.
The molecular revolution: DNA (1980s–1990s)
Then came the one genuinely transformative advance, and the book treats it as the field's emblem of real progress for good reason. DNA profiling is the rare forensic method built from the start on rigorous, quantifiable science, and its arrival reset the standard for what "identification" should mean.
- DNA profiling is developed (mid-1980s). Work in the United Kingdom established that variation in human DNA could be read as an individual-specific profile, and the technique entered casework in the second half of the 1980s. Crucially, DNA came with something no prior pattern method had: a way to state how rare a profile is — the random match probability (RMP) of Chapter 7 — grounding a "match" in a number rather than an examiner's confidence.
- The first DNA exoneration and the first DNA conviction (late 1980s). In the United Kingdom, the first use of DNA evidence in a criminal case both cleared an innocent suspect and then identified the actual perpetrator — a single early case that captures the book's whole thesis: the technology's first power is to exclude. The United States saw its first convictions secured by DNA evidence in the same period, and its first death-row DNA exoneration (Kirk Bloodsworth) at the start of the 1990s.
- The "DNA wars" and the standardization that followed (late 1980s–1990s). Early DNA evidence faced serious, legitimate scientific challenges over methods, statistics, and laboratory practice. The field's response is what distinguishes it: rather than circling the wagons, it produced standardized methods, validated protocols, and population-genetics frameworks, including authoritative reviews by the National Research Council. DNA became more rigorous because it was challenged — the opposite of what happened to the pattern methods that were never seriously tested.
- The shift to STR analysis and the founding of CODIS (1990s). The field standardized on short tandem repeat (STR) analysis (Chapter 7), and the U.S. Combined DNA Index System (CODIS) — the national database linking DNA profiles across cases and jurisdictions — was established in the 1990s, enabling database "cold hits" that became a central investigative tool.
DNA did two things to the rest of forensic science. It gave the field its one fully validated method and its top-of-the-spectrum benchmark. And — because DNA could be tested against old convictions — it became the instrument that exposed how unreliable much of the rest of the field had been. The revolution and the reckoning are the same event seen from two sides.
The reckoning: exonerations and the reports (1990s–2010s)
Once DNA could re-test old cases, it began returning a verdict on forensic science itself, and the verdict was damning. The exonerations were not random; they revealed a pattern of causes, and that pattern forced two landmark scientific reviews that the field is still reckoning with.
- The Innocence Project and the wave of DNA exonerations (founded 1992, growing through the 2000s–2010s). The Innocence Project and the broader Innocence Network used post-conviction DNA testing to overturn hundreds of wrongful convictions in the United States. The exonerations were a natural experiment: in case after case, DNA revealed what had actually gone wrong, and the recurring causes (Chapter 6, Chapter 34) included mistaken eyewitness identification, false confessions, and — in a large share — flawed or overstated forensic testimony. The field could no longer treat its errors as isolated; the data showed a systemic problem.
- Pattern methods exposed by DNA. Specific disciplines were discredited or sharply curtailed as DNA re-testing exonerated people convicted on their say-so. Bite-mark comparison (Chapter 16) was implicated in multiple exonerations (including the cases of Ray Krone and Roy Brown) and is now the field's most thoroughly debunked discipline. Microscopic hair comparison (Chapter 19) collapsed as a source of individualization; in the 2010s the FBI publicly acknowledged that examiners had, over a long period, given testimony that overstated what microscopic hair analysis could support — a stunning institutional admission.
- The 2009 NAS report — the field's diagnosis. Strengthening Forensic Science in the United States: A Path Forward, produced by the National Academy of Sciences (the NAS 2009 report, Chapter 6), found the field "badly fragmented" and concluded that, with the notable exception of DNA analysis, no forensic method had been rigorously shown to consistently and reliably demonstrate a connection between evidence and a specific individual or source. It called for independent oversight, mandatory accreditation and certification, and — above all — research to establish the validity and error rates of the pattern disciplines. It is the single most important document in the field's modern history.
- The 2016 PCAST report — the validity test made explicit. The President's Council of Advisors on Science and Technology report Forensic Science in Criminal Courts: Ensuring Scientific Validity of Feature-Comparison Methods (the 2016 PCAST report, Chapter 6) sharpened the NAS critique into a specific standard: foundational validity, established by appropriately designed empirical studies with measured error rates. Applying that test, PCAST found that some methods (single-source DNA) met it, others (such as latent fingerprint analysis) had only limited foundational validity, and several (including bite-mark comparison and aspects of firearms/toolmark identification) lacked it. PCAST gave courts and analysts the explicit yardstick the book uses throughout.
- Standards infrastructure: OSAC (mid-2010s). Partly in response to the NAS report, the National Institute of Standards and Technology established the Organization of Scientific Area Committees for Forensic Science (OSAC) (Chapter 38) to develop and curate documentary standards across the disciplines — an institutional attempt to build the validation and standardization the reports demanded.
The reckoning is not a story of the field being attacked from outside. The exonerations were powered by a forensic method (DNA), the reports were written by scientists, and the most damning admissions (the FBI hair-comparison review) came from inside. That is the honest shape of it: forensic science's own best tools exposed forensic science's worst practices.
The laboratories themselves: scandals and quality (ongoing)
Running underneath the science is a separate, sobering thread — the failures not of methods but of laboratories and the people in them. A valid method run dishonestly or incompetently produces confident, wrong results at scale, and the case record contains several disasters that tainted tens of thousands of convictions at once (Chapter 4, Chapter 38).
- Serology and laboratory fraud (1980s–1990s onward). Cases such as the West Virginia serology scandal associated with Fred Zain showed that a single analyst, fabricating or overstating results over years, could corrupt enormous numbers of cases — and that the institutions around the analyst often failed to catch it.
- The Massachusetts drug-lab scandals (early 2010s). Two separate state chemists — Annie Dookhan, who "dry-labbed" by reporting results for tests she never performed, and Sonja Farak, who was tampering with and using lab drug samples — together tainted tens of thousands of criminal cases, leading to mass dismissals of convictions. These are the book's standing examples (Chapter 4) that the lab, not just the method, sets the ceiling on evidence quality.
- The quality-systems response. Across the same decades the field built the machinery meant to prevent exactly these failures: accreditation to standards such as ISO/IEC 17025, formal quality assurance and quality control, proficiency testing, and method validation (Chapter 4). The reforms are real and they matter — and, as Chapter 38 argues, they remain unevenly adopted, and the deeper structural problem (most crime labs sit administratively inside law-enforcement agencies) is largely unaddressed.
The lesson of this thread is the one the whole book repeats: validity lives in the method and the practice, never in the method alone. The most rigorous technique in the world, run by a dishonest or unchecked analyst, is a wrongful-conviction engine.
The genealogy era and the present (2018–present)
The most recent chapter of the advance thread is, fittingly, a return to DNA — but DNA used in an entirely new way, and the book's emblem of validated modern progress.
- Investigative genetic genealogy and the Golden State Killer (2018). Investigators identified Joseph James DeAngelo as the Golden State Killer by uploading a crime-scene DNA profile to a consumer-style genealogy database, building family trees from distant relatives, and narrowing to a single suspect (Chapters 8, 29, 39). Investigative genetic genealogy (IGG) broke open numerous cold cases in the years that followed and stands, in the book's framing, as the contemporary proof that real forensic progress is possible — while raising serious, unresolved questions about genetic privacy (Chapter 38) that the field is still working through.
- Rapid DNA, probabilistic genotyping, and AI (present). Rapid DNA instruments compress profiling from weeks to hours; probabilistic genotyping software brings statistical rigor — and a "black-box" problem — to the hard problem of DNA mixtures (Chapter 9); and machine-learning tools are entering several disciplines (Chapter 29). Each is promising, and each is being met — at least by the field's honest practitioners — with the same question the reports taught us to ask of everything new: what is the error rate, and how do we know?
- The unfinished reform. The present moment is defined by a gap. The diagnosis (NAS 2009) and the prescription (PCAST 2016) exist; the standards body (OSAC) exists; the validated method (DNA, now IGG) exists. What remains incomplete is implementation — independence for crime labs, blind verification, mandatory accreditation and certification, and the routine use of context management to control bias (Chapters 31, 38). The history of forensic science is, at this writing, a story whose reforms are known and only partly carried out.
How to read this timeline
Step back and look at the braid one last time. The advance thread is genuine and worth celebrating: from chemical tests for poison, to fingerprints, to the comparison microscope, to DNA, to investigative genetic genealogy, the field has built real tools that exclude the innocent and identify the guilty. The failure thread is equally real and must never be sanded off: unvalidated pattern methods admitted for decades on "general acceptance," bite marks and microscopic hair comparison that helped convict the innocent, lab scandals that tainted tens of thousands of cases, and a reform agenda still only half-implemented.
The single habit this timeline should leave you with is the one the whole book teaches: do not confuse a method's age, prestige, or courtroom familiarity with its validity. Fingerprinting is old and useful and also a human judgment under a real error rate. DNA is newer and more rigorous precisely because it was challenged and tested. The question that sorts the advances from the failures is never "how long have we used it?" or "do experts accept it?" It is the question the NAS and PCAST reports finally forced into the open, and the question you should now ask of every method in this history and every method still to come: how do we know it works, and how strong, really, is what it can establish?