Chapter 18: Further Reading - Game Outcome Prediction
Academic Papers
Rating Systems
-
"Elo Ratings: A System's Perspective" - Arpad Elo (1978) - Original Elo rating system paper - Mathematical foundations - Chess rating implementation
-
"A Bradley-Terry Type Model for Forecasting Tennis Match Results" - Klaassen & Magnus - Extensions of paired comparison models - Application to sports prediction - Statistical inference methods
-
"The Power of Elo: Applications to Sports Analytics" - Hvattum & Arntzen - Elo adaptations for team sports - Soccer and football applications - Comparison with other methods
-
"Margin of Victory and NFL Game Predictions" - Journal of Quantitative Analysis in Sports - Incorporating margin into ratings - Optimal K-factor selection - Home field advantage estimation
Machine Learning in Sports
-
"Machine Learning Methods for Predicting NFL Game Outcomes" - Comparison of ML algorithms - Feature engineering approaches - Cross-validation strategies
-
"Deep Learning for Sports Prediction" - Recent arXiv preprints - Neural network architectures - Sequence modeling for sports - State-of-the-art methods
-
"Ensemble Methods for Sports Forecasting" - Combining multiple models - Weight optimization - Stacking approaches
Books
Statistical Prediction
-
"Statistical Sports Models in Excel" - Andrew Mack - Practical spreadsheet implementations - Step-by-step tutorials - Football-specific examples
-
"The Signal and the Noise" - Nate Silver - Prediction philosophy - Sports prediction chapter - FiveThirtyEight methodology insights
-
"Superforecasting" - Philip Tetlock - Probability calibration - Expert prediction improvement - Uncertainty quantification
Machine Learning
-
"Hands-On Machine Learning with Scikit-Learn" - Aurélien Géron - Python ML implementation - Model evaluation techniques - Production deployment
-
"Feature Engineering and Selection" - Max Kuhn & Kjell Johnson - Comprehensive feature engineering - Selection methods - Sports analytics examples
-
"Probabilistic Machine Learning" - Kevin Murphy - Probabilistic prediction - Bayesian approaches - Uncertainty quantification
Online Resources
Rating Systems
-
FiveThirtyEight NFL Elo Methodology - https://fivethirtyeight.com/methodology/how-our-nfl-predictions-work/ - Detailed Elo implementation - Home field and rest adjustments
-
ESPN FPI Methodology - ESPN's Football Power Index explanation - Efficiency-based ratings - Preseason initialization
-
SP+ Methodology (Bill Connelly) - Football-specific efficiency ratings - Play-by-play adjustments - Historical analysis
Data Sources
-
nflfastR / cfbfastR - R packages for NFL/CFB data - Play-by-play statistics - https://www.nflfastr.com/
-
College Football Data API - Free CFB data access - Historical game results - https://collegefootballdata.com/
-
Sports Reference - Comprehensive historical data - Team and player statistics - https://www.sports-reference.com/cfb/
Tools and Libraries
Python Prediction Stack
-
scikit-learn - https://scikit-learn.org - Core ML library - Model selection and evaluation - Cross-validation tools
-
XGBoost - https://xgboost.readthedocs.io - Gradient boosting implementation - High performance - Feature importance
-
LightGBM - https://lightgbm.readthedocs.io - Fast gradient boosting - Categorical feature handling - Large dataset support
-
CatBoost - https://catboost.ai - Handles categorical data natively - Good default parameters - Robust to overfitting
Specialized Tools
-
elopy - Python Elo implementation - Easy rating system setup - Custom K-factor support
-
sportsipy - Sports reference scraping - Automated data collection - Multiple sports support
-
betting-models - Betting analytics - Line comparison tools - ROI calculations
Industry Blogs and Sites
Analytics Sites
-
Football Outsiders - https://www.footballoutsiders.com/ - DVOA methodology - Advanced statistics - Game previews
-
The Athletic (Analytics Coverage) - In-depth analysis - Model explanations - Industry insights
-
FiveThirtyEight Sports - Prediction methodology - Model performance tracking - Interactive tools
Technical Blogs
-
Towards Data Science - Sports Analytics - Tutorial articles - Code examples - https://towardsdatascience.com/
-
Pinnacle Betting Resources - Market efficiency analysis - Betting mathematics - Model evaluation
-
SBNation (Football Study Hall) - Bill Connelly's work - SP+ explanations - Advanced stats
Competitions and Practice
Kaggle Competitions
-
NFL Big Data Bowl - Annual competition - Tracking data - Novel analytics - Real NFL evaluation
-
March Machine Learning Mania - NCAA Tournament - Game prediction - Probability calibration - Public leaderboard
-
NFL 1st and Future - Player safety - Prediction challenges - Industry prizes
Practice Datasets
-
Historical NFL/CFB Results - Available on Kaggle - Multiple decades - Clean format
-
nflfastR Data Repository - Play-by-play data - Regular updates - Documentation
Podcasts and Videos
Podcasts
- "Bet The Process" - Sports betting analytics
- "PFF NFL Podcast" - Advanced statistics
- "The Analytics Edge" - Sports prediction
- "Thinking Basketball" - Analytics philosophy (transferable)
Video Courses
-
"Sports Analytics" - Coursera - University of Michigan - Prediction fundamentals - Python implementation
-
"Machine Learning A-Z" - Udemy - Comprehensive ML - Sports examples - Practical focus
Research Groups
Academic
-
CMU Statistics in Sports - Carnegie Mellon research - Published papers - Student projects
-
Stanford Sports Analytics - Research initiatives - Industry connections - Technical papers
-
MIT Sloan Sports Analytics Conference - Annual conference - Research papers - Industry presentations
Industry
-
ESPN Analytics - FPI development - QBR methodology - Win probability
-
Pro Football Focus (PFF) - Grading systems - Expected points - Player projections
Suggested Learning Path
Week 1-2: Rating Systems
- Implement basic Elo from scratch
- Add home field advantage
- Study FiveThirtyEight methodology
Week 3-4: Feature Engineering
- Create team strength features
- Build matchup differentials
- Add situational features
Week 5-6: Model Building
- Train multiple model types
- Implement cross-validation
- Build ensemble
Week 7-8: Evaluation and Calibration
- Calculate all metrics
- Plot calibration curves
- Compare to baselines
Week 9-10: Production Systems
- Build prediction pipeline
- Generate weekly predictions
- Track performance over time
Key Papers by Topic
Elo and Ratings
- Elo, A. (1978). The Rating of Chessplayers
- Glickman, M. (1999). Parameter estimation in large dynamic paired comparison experiments
Sports Prediction
- Boulier, B. & Stekler, H. (1999). Are sports seedings good predictors?
- Song, C. et al. (2007). Limits of predictability in human mobility
Calibration
- Gneiting, T. & Raftery, A. (2007). Strictly proper scoring rules
- Niculescu-Mizil, A. & Caruana, R. (2005). Predicting good probabilities
Ensemble Methods
- Dietterich, T. (2000). Ensemble methods in machine learning
- Caruana, R. et al. (2004). Ensemble selection from libraries of models
Citation Format
APA Format:
Author, A. A. (Year). Title of work. Publisher/Journal.
Example:
Silver, N. (2012). The Signal and the Noise: Why So Many
Predictions Fail—but Some Don't. Penguin Press.