Further Reading: Descriptive Statistics in Football
Foundational Statistics Textbooks
For Beginners
-
"Statistics" by David Freedman, Robert Pisani, and Roger Purves - Classic introduction with intuitive explanations - Strong emphasis on understanding concepts before formulas - Excellent for building statistical intuition - ISBN: 978-0393929720
-
"Naked Statistics" by Charles Wheelan - Accessible, entertaining introduction to statistics - Real-world examples throughout - Great for understanding why statistics matter - ISBN: 978-0393347777
-
"The Art of Statistics" by David Spiegelhalter - Modern approach to statistical thinking - Focuses on interpretation and communication - Excellent data visualization examples - ISBN: 978-1541618510
For Deeper Understanding
-
"All of Statistics" by Larry Wasserman - Comprehensive coverage of statistical methods - More mathematical rigor - Good reference for advanced techniques - ISBN: 978-0387402727
-
"Introduction to Statistical Learning" by James, Witten, Hastie, Tibshirani - Bridges descriptive and predictive statistics - Excellent R and Python applications - Free PDF available: https://www.statlearning.com - ISBN: 978-1071614174
Sports Analytics Literature
Books
-
"Mathletics" by Wayne Winston - Comprehensive sports analytics textbook - Covers multiple sports including football - Strong quantitative foundation - ISBN: 978-0691177625
-
"Analyzing Baseball Data with R" by Marchi and Albert - While baseball-focused, teaches transferable skills - Excellent examples of descriptive statistics in sports - Strong R programming instruction - ISBN: 978-0815353515
-
"Football Analytics with Python & R" by Eric Eager and Richard Erickson - Directly applicable to NFL analytics - Modern analytical techniques - Comprehensive code examples - ISBN: 978-1492099611
Academic Papers
-
"Expected Points and EPA Explained" - nflfastR Documentation - Foundation for understanding modern football metrics - https://www.nflfastr.com/articles/beginners_guide.html
-
"A New Way to Measure Clutch" by Bill Barnwell - Application of z-scores to performance evaluation - ESPN Analytics methodology
-
"Consistency and the NFL" by Chase Stuart - Analysis of game-to-game variation - Football Perspective archives
Online Resources
Websites and Blogs
-
Football Outsiders (https://www.footballoutsiders.com) - Pioneer in advanced football statistics - DVOA methodology explained - Regular statistical analysis articles
-
ESPN Stats & Information (https://www.espn.com/blog/statsinfo) - Professional sports statistics - QBR and other proprietary metrics explained
-
Pro Football Reference (https://www.pro-football-reference.com) - Comprehensive statistical database - Glossary of statistics - Historical comparisons
-
College Football Reference (https://www.sports-reference.com/cfb/) - Complete college football statistics - School and player histories - Advanced metrics
-
Bill Connelly's Work (ESPN/SB Nation archives) - SP+ rating system - Five Factors of football - Extensive college football analysis
Python/Pandas Resources
-
Pandas Documentation (https://pandas.pydata.org/docs/) - Official documentation - Descriptive statistics methods:
.describe(),.corr(),.agg()- GroupBy operations -
SciPy Stats Tutorial (https://docs.scipy.org/doc/scipy/tutorial/stats.html) - Statistical functions in Python - Distribution fitting - Hypothesis testing
-
Real Python Statistics Tutorials (https://realpython.com/python-statistics/) - Practical Python statistics guides - Step-by-step examples - Best practices
Data Sources
Free College Football Data
-
CollegeFootballData.com API (https://collegefootballdata.com) - Comprehensive play-by-play data - Team and player statistics - Free API with reasonable limits - Python wrapper:
cfbdpackage -
cfbfastR (https://cfbfastr.sportsdataverse.org/) - R package for college football data - Based on ESPN and CFBD data - Python equivalent:
cfbfastR-py -
Sports Reference Data (https://www.sports-reference.com) - Historical statistics - CSV export available - Extensive school-by-school data
Commercial/Professional Sources
-
Pro Football Focus (PFF) - Detailed player grades - Play-by-play charting - Subscription required for full access
-
Sports Info Solutions (SIS) - Advanced tracking data - Professional analytics services
-
Second Spectrum / NFL Next Gen Stats - Player tracking data - Professional use primarily
Video Resources
Courses
-
Khan Academy Statistics (https://www.khanacademy.org/math/statistics-probability) - Free, comprehensive statistics course - Interactive exercises - Video explanations
-
Coursera: Statistics with Python (University of Michigan) - Applied statistics using Python - pandas and scipy focus - Certificate available
-
DataCamp: Statistical Thinking (https://www.datacamp.com) - Interactive Python-based learning - Sports analytics tracks available - Subscription-based
YouTube Channels
-
StatQuest with Josh Starmer - Clear explanations of statistical concepts - Visual approach to learning - https://www.youtube.com/user/joshstarmer
-
3Blue1Brown - Mathematical intuition - Excellent visualizations - https://www.youtube.com/c/3blue1brown
-
Sports Analytics Research - MIT Sloan Sports Analytics Conference talks - Available on YouTube - Professional presentations
Practice Datasets
Recommended Datasets for Practice
-
NFL Play-by-Play (nflfastR) - Complete NFL play data - EPA and WPA calculated - https://github.com/nflverse/nflverse-data
-
CFB Play-by-Play (cfbfastR) - College football play data - 2014-present seasons - https://github.com/sportsdataverse/cfbfastR-data
-
Kaggle NFL Datasets - Various competition datasets - Tracking data available - https://www.kaggle.com/datasets?search=nfl
-
FiveThirtyEight Sports Data - Curated sports datasets - Methodology documentation - https://github.com/fivethirtyeight/data
Conferences and Community
Academic/Professional Conferences
-
MIT Sloan Sports Analytics Conference - Premier sports analytics conference - Research paper competitions - https://www.sloansportsconference.com
-
SABR Analytics Conference - Society for American Baseball Research - Increasingly multi-sport - Statistical methodology focus
-
useR! and PyCon - R and Python conferences - Sports analytics tracks - Open source community
Online Communities
-
r/CFBAnalysis (Reddit) - College football analytics discussion - Data sharing - Project collaboration
-
Sports Analytics Twitter/X - Key accounts: @benbbaldwin, @PFF_College, @ESPN_Analytics - Real-time discussion - New research sharing
-
Discord Communities - nflverse Discord for NFL data - sportsdataverse for college sports - Active developer communities
Suggested Learning Path
Week 1-2: Statistics Foundations
- Read: "Naked Statistics" Chapters 1-5
- Practice: Khan Academy descriptive statistics
- Apply: Calculate team statistics with pandas
Week 3-4: Sports Applications
- Read: Football Outsiders methodology articles
- Practice: Analyze one season of team data
- Apply: Build correlation analysis of stats vs. wins
Week 5-6: Advanced Techniques
- Read: "Mathletics" football chapters
- Practice: Z-score player comparisons
- Apply: Create player consistency analysis
Week 7-8: Integration
- Read: Bill Connelly's SP+ explanation
- Practice: Reproduce a published analysis
- Apply: Present findings with visualizations
Citation Format
When using statistical concepts in your own work, consider citing:
For foundational statistics:
Freedman, D., Pisani, R., & Purves, R. (2007). Statistics (4th ed.).
W. W. Norton & Company.
For sports analytics methodology:
Winston, W. L. (2012). Mathletics: How Gamblers, Managers, and Sports
Enthusiasts Use Mathematics in Baseball, Basketball, and Football.
Princeton University Press.
For college football data:
CollegeFootballData.com. (Year). [Dataset name]. Retrieved from
https://collegefootballdata.com