Further Reading: Communicating with Data: Telling Stories with Numbers
Books
For Deeper Understanding
Edward R. Tufte, The Visual Display of Quantitative Information, 2nd edition (2001) The foundational text on data visualization design. Tufte introduces the data-ink ratio, chartjunk, small multiples, and sparklines in this beautifully produced volume. The book is as much a work of art as it is a textbook — Tufte practices what he preaches, with elegant page layouts and carefully chosen examples spanning centuries of visual communication. Every chart, map, and diagram in the book demonstrates a principle being discussed. If you read one book on data visualization in your life, make it this one.
Edward R. Tufte, Envisioning Information (1990) Tufte's second book extends his principles to complex information displays: maps, timetables, medical graphics, and computer interfaces. The chapter on "small multiples" is the definitive treatment. The chapter on "layering and separation" — using color, value, and position to separate different types of information on the same display — is directly applicable to the multi-variable charts we create in statistical analysis.
Edward R. Tufte, Beautiful Evidence (2006) Tufte's most recent full-length book focuses on how visual evidence is presented and consumed. The section on PowerPoint's role in the Columbia space shuttle disaster — where a critical engineering concern was buried in a busy, nested bullet-point slide — is a sobering case study in how bad communication can have life-or-death consequences. Required reading for anyone who presents data in slide format.
Cole Nussbaumer Knaflic, Storytelling with Data: A Data Visualization Guide for Business Professionals (2015) Where Tufte is theoretical and artistic, Knaflic is practical and corporate. This book is a step-by-step guide to creating business-quality charts, with before-and-after makeovers that mirror the approach in this chapter. Chapters on "choosing an effective visual," "clutter is your enemy," and "tell a story" provide a clear framework for professionals who need to present data to non-technical stakeholders. Highly recommended for the business-oriented reader.
Cole Nussbaumer Knaflic, Storytelling with Data: Let's Practice! (2019) The companion workbook to the above, with over 100 hands-on exercises. If you liked the chart revision exercises in this chapter's exercise set, this book provides hundreds more opportunities to practice identifying problems and redesigning visualizations.
Alberto Cairo, The Truthful Art: Data, Charts, and Maps for Communication (2016) Cairo provides a comprehensive guide to data visualization that balances aesthetics with statistical rigor. His five qualities of great visualizations — truthful, functional, beautiful, insightful, and enlightening — provide a useful framework that goes beyond Tufte's focus on minimalism. The chapter on "The Truthfulness Continuum" explores the ethical territory between outright fabrication and subtle framing — the same territory covered in this chapter's ethical analysis block.
Alberto Cairo, How Charts Lie: Getting Smarter about Visual Information (2019) A more accessible companion to The Truthful Art, focused specifically on misleading visualizations. Cairo catalogs the ways charts can mislead — through poor design, dubious data, insufficient context, or deliberate deception — and teaches readers to detect each one. The examples are drawn from recent politics, media, and science, making them immediately relevant. This is the "consumer" side of data visualization literacy.
For the Conceptually Curious
Claus O. Wilke, Fundamentals of Data Visualization: A Primer on Making Informative and Compelling Figures (2019) A modern, comprehensive guide available free online (clauswilke.com/dataviz/). Wilke covers every chart type with clear guidelines for when to use each one, how to choose color scales, how to handle overlapping data, and how to design for accessibility. The chapter on "Visualizing Uncertainty" is particularly relevant to this chapter's emphasis on honest communication.
William S. Cleveland, The Elements of Graphing Data, 2nd edition (1994) Cleveland's research on human perception of graphical elements — the studies showing that we judge position along a common scale more accurately than angle or area — provides the empirical foundation for many of this chapter's recommendations. The hierarchy of visual perception (position > length > angle > area > color > volume) is one of the most important concepts in data visualization.
Jonathan Schwabish, Better Data Visualizations: A Guide for Scholars, Researchers, and Wonks (2021) Targeted at academic and policy audiences, this book addresses the specific challenges of visualizing research results. The sections on presenting regression results, showing distributions, and designing for print vs. presentation are directly applicable to the skills practiced in this chapter.
Articles and Papers
Cleveland, W. S., and McGill, R. (1984). "Graphical Perception: Theory, Experimentation, and Application to the Development of Graphical Methods." Journal of the American Statistical Association, 79(387), 531-554. The landmark empirical study showing that humans judge position along a common scale more accurately than length, angle, slope, area, or color. This research provides the scientific basis for preferring bar charts over pie charts and position-based encodings over area-based ones. The paper is technical but the core findings are clearly presented in the figures.
Heer, J., and Bostock, M. (2010). "Crowdsourcing Graphical Perception: Using Mechanical Turk to Assess Visualization Design." Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 203-212. An elegant replication and extension of Cleveland and McGill's 1984 study using Amazon Mechanical Turk. The authors confirm the original perception hierarchy and extend it to new chart types. This paper also demonstrates a modern approach to experimental research on visualization effectiveness.
Schwabish, J. A. (2014). "An Economist's Guide to Visualizing Data." Journal of Economic Perspectives, 28(1), 209-234. A practical guide to improving data visualization in academic economics papers, but the advice applies to any field. Schwabish provides concrete examples of common visualization mistakes in published papers and shows how to fix them. The before-and-after examples are particularly instructive.
Franconeri, S. L., Padilla, L. M., Shah, P., Zacks, J. M., and Hullman, J. (2021). "The Science of Visual Data Communication: What Works." Psychological Science in the Public Interest, 22(3), 110-161. A comprehensive review of the empirical research on data visualization effectiveness. The authors synthesize decades of cognitive psychology research to provide evidence-based guidelines for chart design. Particularly valuable for understanding why certain design choices work — the perceptual and cognitive mechanisms behind effective visualization.
Wong, B. (2011). "Points of View: Color Blindness." Nature Methods, 8(6), 441. A concise, one-page guide to colorblind-friendly design in scientific figures. Wong's recommended color palette (blue, orange, sky blue, bluish green, yellow, vermilion, reddish purple) is widely used in scientific publications and is the palette recommended in this chapter's accessibility section.
Kastellec, J. P., and Leoni, E. L. (2007). "Using Graphs Instead of Tables in Political Science." Perspectives on Politics, 5(4), 755-771. A compelling argument that graphs communicate statistical results more effectively than tables in most contexts. The authors convert published regression tables into graphs and demonstrate how much easier it is to understand the results visually. Relevant to the choice between regression tables and coefficient plots in research communication.
Online Resources
FlowingData (Nathan Yau) https://flowingdata.com/
A long-running blog and tutorial site by Nathan Yau, a statistician and data visualization designer. The "How to" section includes step-by-step tutorials for creating polished visualizations in R and Python. The "Mistakes" section catalogs real-world examples of misleading visualizations — an excellent supplement to this chapter's rogues' gallery.
Datawrapper Blog https://blog.datawrapper.de/
The engineering and design team behind Datawrapper (a popular chart-creation tool) publishes detailed articles on visualization best practices. Their posts on "What to Consider When Choosing Colors for Data Visualization," "How to Choose the Right Chart Type," and "When to Show Data as a Table Instead of a Chart" provide practical guidance grounded in research.
PolicyViz (Jonathan Schwabish) https://policyviz.com/
A resource focused on improving data communication in policy and research contexts. The podcast features interviews with data visualization practitioners, and the blog includes detailed chart makeovers, presentation tips, and guidelines for communicating statistical findings to policymakers — directly relevant to James's case study in this chapter.
Colorbrewer 2.0 https://colorbrewer2.org/
An interactive tool for selecting color schemes for maps and charts, developed by Cynthia Brewer. You can filter by colorblind-safe, print-friendly, and photocopy-safe, making it invaluable for creating accessible visualizations. Every palette includes guidance on the maximum number of distinguishable categories.
Matplotlib Documentation: "Making Professional Plots" https://matplotlib.org/stable/tutorials/introductory/customizing.html
The official matplotlib guide to customizing plots, including rcParams, stylesheets, and professional formatting. This documentation is the reference for the Python techniques covered in Section 25.18 of this chapter.
From Data to Viz https://www.data-to-viz.com/
An interactive decision tree that helps you choose the right chart type based on the type of data you have. Start with "one numeric variable," "two numeric variables," or "one numeric + one categorical," and follow the branches to a recommended chart type — with examples, code, and common pitfalls. A fantastic reference for students still building chart-selection intuition.
Reproducibility and Open Science Resources
The Turing Way: A Handbook for Reproducible Data Science https://the-turing-way.netlify.app/
A community-driven guide to reproducible research, covering version control, code review, testing, continuous integration, and more. The chapters on "Reproducible Environments" and "Research Data Management" go deeper than this chapter's reproducibility section, making it an excellent next step for students interested in research careers.
Wilson, G., et al. (2017). "Good Enough Practices in Scientific Computing." PLoS Computational Biology, 13(6), e1005510. A practical guide to computational reproducibility that strikes the right balance between ideal practices and what's actually achievable by working researchers. The recommendations — organize projects consistently, record dependencies, use version control, automate what you can — are the expanded version of the reproducibility checklist in this chapter.