Preface
Why Another Statistics Textbook?
Let's be honest: you're probably holding this book (or scrolling through it on a screen) because someone told you that you have to take a statistics course. Maybe it's a requirement for your major. Maybe your advisor said something about "quantitative reasoning" and you nodded politely while internally panicking.
If that's you, take a breath. You're in the right place.
Here's the thing about statistics that most textbooks won't tell you upfront: you already think statistically. Every time you check the weather forecast and decide whether to bring an umbrella, you're making a decision under uncertainty. Every time you read a product review and wonder whether to trust it, you're evaluating a sample. Every time you hear a news headline claiming that some food causes cancer and you think "really?", you're doing informal hypothesis testing.
This book is about turning that informal, gut-level statistical reasoning into something precise, powerful, and trustworthy.
Who This Book Is For
This textbook was written for students taking their first statistics course — especially those who consider themselves "not math people." If any of the following describe you, you're our target reader:
- You're majoring in psychology, nursing, education, business, criminal justice, sociology, biology, communications, or another field that requires a statistics course
- Your last math class was a while ago, and you're a little nervous about it
- You can do basic algebra (solve for x, read a graph, work with percentages), but calculus is not part of your vocabulary — and that's perfectly fine
- You've never written a line of code before
- You want to understand the data-driven world around you, even if you never planned to become a data scientist
If you're a math or CS major looking for a rigorous, proof-based introduction to mathematical statistics, this book isn't designed for you (though you might enjoy the applications and the AI connections). We recommend a text like Mathematical Statistics with Applications by Wackerly, Mendenhall, and Scheaffer for that audience.
How This Book Is Different
We Lead with Intuition, Not Formulas
Every concept in this book starts with a concrete, relatable example. We build your intuition first, then show you the formula. The formula is never the point — the understanding is. When you encounter a formula, it will always be accompanied by a plain-English explanation of what it means and why it works.
We're Honest About What We Don't Know
Statistics is fundamentally about uncertainty, and we practice what we preach. When a concept has common misunderstandings (like what a p-value actually means), we address them directly. When there's genuine debate in the field (like the value of statistical significance), we present multiple perspectives honestly.
We Use Real Tools
You'll learn to do statistics with both Python and Excel/Google Sheets. Python is the industry standard for data analysis, and we use beginner-friendly Jupyter notebooks to make it approachable even if you've never coded before. Excel is included because many of you will use spreadsheets in your careers. Both are practical — neither is a toy.
You'll Leave with a Portfolio
The progressive project running through this book has you analyze a real public dataset from start to finish. By the final chapter, you'll have a polished data analysis report in a Jupyter notebook — something you can show to a potential employer, graduate program, or anyone who asks "so what did you learn in that stats class?"
We Connect Statistics to the World You Live In
Algorithms decide what you see on social media, whether you're approved for a loan, and how your city deploys its police officers. These systems run on statistics. Understanding statistics means understanding the systems that shape your life — and being equipped to push back when those systems get it wrong.
How to Use This Book
See the detailed guide in the next section ("How to Use This Book"), but here's the short version:
- Read actively. Answer the "Check Your Understanding" prompts. Do the "Productive Struggle" exercises. Predict outcomes before reading the answers. Your brain learns by doing, not by passively absorbing.
- Work the project. The progressive portfolio project is designed to make everything concrete. Don't skip it.
- Don't skip the exercises. Statistics is like swimming — you can't learn it by watching someone else do it.
- Use the learning paths. If you already know some material, the 🏃 Fast Track tells you what to skip. If you want more depth, follow the 🔬 Deep Dive pointers.
- Come back to earlier chapters. The spaced review sections at the end of each chapter revisit earlier material for a reason — spaced repetition is one of the most effective learning strategies ever documented.
A Note on the AI Dimension
The subtitle of this book — "Making Sense of Data in the Age of AI" — isn't just marketing. It reflects a conviction: that statistical literacy is more important now than at any point in human history.
AI systems are making decisions that affect real people's lives, and those systems are built on statistical foundations. A recommendation algorithm is a regression model. A spam filter is a classifier. A facial recognition system relies on probability distributions and Bayesian updating.
You don't need to become an AI engineer to benefit from understanding this. You just need to be a critical consumer of data — someone who can ask the right questions when someone tells you "the algorithm says so."
That's what this book teaches you to become.
Acknowledgments
This textbook was generated with AI assistance using Claude (Anthropic) and reviewed for accuracy. It stands on the shoulders of excellent statistics textbooks that came before it, particularly the work of Mario Triola, David Moore, George Cobb, and the OpenStax team. The pedagogical approach draws on research in learning science by Robert Bjork, Elizabeth Bjork, Henry Roediger, and Mark McDaniel.
Any errors are our own (or the AI's — we've tried to catch them all, but we appreciate readers who report issues).
Open Source
This textbook is released under a Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0). You are free to share, adapt, and build upon this work, even commercially, as long as you give appropriate credit and distribute derivative works under the same license.
We believe that access to high-quality educational materials should not be limited by cost. If this book helps you learn statistics, consider contributing back: fix a typo, suggest a better example, translate a chapter, or simply tell a classmate about it.
Let's get started.