Preface

"The more I learn about football, the more I realize how much I don't know." — Johan Cruyff


Why This Book Exists

Soccer is the world's most popular sport, watched and played by billions across every continent. In recent decades, a quiet revolution has transformed how the game is understood, analyzed, and played at the highest levels. Data analytics, once viewed with skepticism by traditionalists, has become an indispensable tool for clubs competing for the world's most prestigious trophies.

Yet despite this transformation, a significant gap has persisted between the cutting-edge work being done inside professional clubs and the educational materials available to aspiring analysts. Academic courses often teach general data science skills without soccer-specific context. Industry resources tend to be either too superficial or too specialized. Students eager to enter the field have lacked a comprehensive resource that bridges theory and practice.

This textbook aims to fill that gap.

Who This Book Is For

Professional Soccer Analytics and Visualization is designed primarily for undergraduate students with foundational knowledge in statistics and Python programming. However, the content has been structured to serve multiple audiences:

  • Undergraduate students in sports science, data science, statistics, or computer science programs seeking specialization in soccer analytics
  • Graduate students looking for a comprehensive reference and introduction to the field
  • Self-learners with technical backgrounds who want to transition into sports analytics
  • Current analysts seeking to formalize their knowledge or fill gaps in their education
  • Coaches and technical staff wanting to better understand and leverage analytical insights
  • Journalists and content creators aiming to incorporate advanced statistics into their work

What Makes This Book Different

Several principles guided the development of this textbook:

Theory Meets Practice

Every concept is grounded in both academic rigor and real-world application. We don't just explain what xG is—we build an xG model from scratch, evaluate its performance, discuss its limitations, and show how professional clubs actually use it.

Code First

This is not a book about soccer analytics—it's a book about doing soccer analytics. Every chapter includes substantial Python code that readers can run, modify, and extend. We believe you learn analytics by analyzing, not by reading about analyzing.

Building Complexity

The book is carefully sequenced to build knowledge incrementally. Early chapters establish foundations that later chapters extend. By the time you reach advanced topics like deep learning or pitch control models, you'll have all the prerequisite knowledge in place.

Real Data, Real Problems

Exercises and case studies use realistic data and scenarios drawn from professional contexts. You won't find contrived examples here—the problems you'll solve mirror the problems analysts solve in club offices every day.

Balanced Mathematical Treatment

We take a "medium intensity" approach to mathematics. Every formula is explained intuitively, with emphasis on building understanding rather than merely computing results. Derivations are provided for those who want them but clearly marked for those who don't.

How to Use This Book

For Self-Study

If you're learning independently, we recommend proceeding through the book sequentially, at least for Parts I and II. These foundational chapters establish concepts and vocabulary that later chapters assume.

For each chapter: 1. Read the main content carefully, running all code examples 2. Complete the exercises—learning requires active practice 3. Take the self-assessment quiz and aim for 70% before proceeding 4. Work through at least one case study to solidify understanding

Don't rush. The goal is mastery, not completion.

For Course Instructors

This textbook supports various course structures:

  • Single-semester introduction: Parts I-II (Chapters 1-14)
  • Two-semester sequence: Parts I-III (Chapters 1-21)
  • Advanced course: Parts III-IV with Part I as prerequisite
  • Capstone or project course: Part V with selected earlier material

Each chapter includes more exercises than any single course would use, allowing instructors to select problems matching their emphasis. Solutions to all exercises are available to verified instructors.

For Reference

The appendices provide quick-reference material for working analysts: - Appendix A reviews mathematical foundations - Appendix C provides Python patterns and idioms - Appendix E offers a comprehensive glossary - Appendix D catalogs data sources

Prerequisites

To get the most from this textbook, you should have:

Statistics: - Understanding of descriptive statistics (mean, median, standard deviation) - Familiarity with probability concepts (probability distributions, conditional probability) - Basic exposure to hypothesis testing and confidence intervals - Awareness of regression analysis concepts

Programming: - Comfortable writing Python functions and classes - Experience with basic data structures (lists, dictionaries) - Familiarity with reading documentation and debugging - Some exposure to pandas or similar data manipulation libraries

Soccer: - Understanding of basic soccer rules and gameplay - Familiarity with common tactical concepts (formations, positions) - General awareness of major leagues and competitions

If you're uncertain about your preparation, Chapter 3 (Statistical Foundations) and Chapter 4 (Python Programming) provide comprehensive reviews with soccer-specific examples.

Acknowledgments

A textbook of this scope reflects the contributions of countless individuals across the soccer analytics community. We are indebted to the researchers who have published their methods, the practitioners who have shared their experiences, and the open-source developers who have built the tools we all rely upon.

Special thanks to the StatsBomb team for their commitment to open data, which has democratized soccer analytics education. The mplsoccer and socceraction libraries, along with many others, have made sophisticated analysis accessible to students worldwide.

We are also grateful to the reviewers, instructors, and students who provided feedback on early drafts. Your insights improved every chapter.

A Final Word

Soccer analytics is a young field, still finding its footing between art and science. The best analysts combine rigorous quantitative skills with deep football knowledge and strong communication abilities. No textbook can fully develop all three dimensions—but we hope this one provides a solid foundation in the first while nurturing appreciation for the others.

We wrote this book because we believe in the power of data to illuminate the beautiful game, and because we want to help the next generation of analysts reach even further than we have. If you're holding this book, you're joining a community of curious, passionate people who share that vision.

Welcome. Let's get to work.


The Authors 2024