Chapter 26: Machine Learning in Basketball - Further Reading
Foundational Machine Learning Texts
Introductory
James, G., Witten, D., Hastie, T., & Tibshirani, R. (2021). An Introduction to Statistical Learning (2nd ed.). Springer. The essential introduction to statistical learning methods. Free at www.statlearning.com. Covers regression, classification, resampling, tree methods, and unsupervised learning with R examples.
Geron, A. (2022). Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow (3rd ed.). O'Reilly Media. Practical implementation guide covering the complete ML workflow. Excellent for building working systems.
Muller, A. C., & Guido, S. (2016). Introduction to Machine Learning with Python. O'Reilly Media. Python-focused introduction using scikit-learn. Good for those new to ML implementation.
Advanced
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning (2nd ed.). Springer. Comprehensive mathematical treatment of machine learning. Free online. More rigorous than ISLR.
Bishop, C. M. (2006). Pattern Recognition and Machine Learning. Springer. Classic ML text with Bayesian perspective. Mathematically rigorous.
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press. The definitive deep learning textbook. Free at www.deeplearningbook.org.
Sports Analytics Applications
Basketball-Specific
Goldsberry, K. (2019). Sprawlball: A Visual Tour of the New Era of the NBA. Mariner Books. Visual analytics approach to modern basketball. Demonstrates effective data visualization.
Oliver, D. (2004). Basketball on Paper. Potomac Books. Foundation for basketball analytics. Essential context for ML applications.
Kubatko, J., Oliver, D., Pelton, K., & Rosenbaum, D. T. (2007). "A Starting Point for Analyzing Basketball Statistics." Journal of Quantitative Analysis in Sports, 3(3). Academic foundation for basketball metrics.
Cross-Sport ML Applications
Albert, J., Glickman, M. E., Swartz, T. B., & Koning, R. H. (Eds.). (2017). Handbook of Statistical Methods and Analyses in Sports. Chapman and Hall/CRC. Comprehensive handbook covering ML applications across sports.
Lewis, M. (2003). Moneyball: The Art of Winning an Unfair Game. W. W. Norton. Classic on data-driven sports decisions. While baseball-focused, principles apply broadly.
Research Papers
Player Evaluation
Cervone, D., D'Amour, A., Bornn, L., & Goldsberry, K. (2016). "A Multiresolution Stochastic Process Model for Predicting Basketball Possession Outcomes." Journal of the American Statistical Association, 111(514), 585-599. Expected Possession Value (EPV) model. Cutting-edge ML application to basketball.
Engelmann, J. (2017). "Possession-Based Player Performance Analysis in Basketball." MIT Sloan Sports Analytics Conference. Academic treatment of player impact metrics.
Clustering and Classification
Wang, K. C., & Zemel, R. (2016). "Classifying NBA Offensive Plays Using Neural Networks." MIT Sloan Sports Analytics Conference. Neural network application to play classification.
Metulini, R., Manisera, M., & Zuccolotto, P. (2018). "Modelling the Dynamic Pattern of Surface Area in Basketball and its Effects on Team Performance." Journal of Quantitative Analysis in Sports, 14(2), 61-75. Tracking data analysis with ML methods.
Deep Learning Applications
Yue, Y., Lucey, P., Carr, P., Bialkowski, A., & Matthews, I. (2014). "Learning Fine-Grained Spatial Models for Dynamic Sports Play Prediction." IEEE International Conference on Data Mining. Deep learning for spatial pattern recognition in sports.
Online Courses
Machine Learning Foundations
Andrew Ng's Machine Learning (Coursera) - https://www.coursera.org/learn/machine-learning - Classic introduction to ML concepts
Fast.ai Practical Deep Learning - https://www.fast.ai/ - Practical, code-first approach to deep learning
Google Machine Learning Crash Course - https://developers.google.com/machine-learning/crash-course - Free, quick introduction with TensorFlow
Specialized Topics
Kaggle Learn - https://www.kaggle.com/learn - Micro-courses on specific ML topics
DataCamp - https://www.datacamp.com/ - Interactive courses including sports analytics tracks
Tools and Libraries
Core Libraries
scikit-learn - https://scikit-learn.org/ - Essential Python ML library - Comprehensive documentation and tutorials
XGBoost - https://xgboost.readthedocs.io/ - Gradient boosting implementation - Often best performance for tabular data
LightGBM - https://lightgbm.readthedocs.io/ - Fast gradient boosting from Microsoft - Good for large datasets
Deep Learning
TensorFlow - https://www.tensorflow.org/ - Google's deep learning framework - Good production support
PyTorch - https://pytorch.org/ - Facebook's deep learning framework - Preferred for research
Interpretation
SHAP - https://github.com/slundberg/shap - Model interpretation library - Essential for explaining predictions
LIME - https://github.com/marcotcr/lime - Local interpretable explanations
Data Sources
NBA Statistics
Basketball-Reference - https://www.basketball-reference.com/ - Comprehensive historical statistics
NBA Stats API - https://www.nba.com/stats/ - Official NBA statistics
nba_api Python Package - https://github.com/swar/nba_api - Python access to NBA.com data
Tracking Data
NBA.com/stats Advanced - Player tracking metrics - Shot location data
Second Spectrum - https://www.secondspectrum.com/ - Official NBA tracking provider - Limited public access
Kaggle Datasets
NBA Players Stats - Various datasets on Kaggle - Search: "NBA statistics kaggle"
Blogs and Websites
Technical ML
Towards Data Science - https://towardsdatascience.com/ - ML tutorials and applications
Machine Learning Mastery - https://machinelearningmastery.com/ - Practical tutorials by Jason Brownlee
Sports Analytics
Nylon Calculus - https://fansided.com/nba/nylon-calculus/ - Basketball analytics writing
Cleaning the Glass - https://cleaningtheglass.com/ - Advanced basketball analysis
FiveThirtyEight - https://fivethirtyeight.com/sports/nba/ - Statistical sports journalism
Recommended Reading Path
For ML Beginners
- ISLR (James et al.) - Chapters 1-6
- Andrew Ng's Coursera course
- Geron's Hands-On ML - Part I
- Practice on Kaggle competitions
For Basketball Analysts New to ML
- Oliver - Basketball on Paper
- ISLR - Chapters 1-4
- scikit-learn tutorials
- Apply to basketball datasets
For Experienced ML Practitioners
- ESL (Hastie et al.) - selected chapters
- Academic papers (Cervone, etc.)
- Advanced techniques (deep learning, tracking data)
- Original research projects
Conference Resources
MIT Sloan Sports Analytics Conference
- https://www.sloansportsconference.com/
- Annual conference with research papers
- Archives available online
CVPR/NeurIPS Sports Workshops
- Computer vision and deep learning conferences
- Increasingly include sports analytics tracks
KDD Sports Analytics Workshop
- https://www.kdd.org/
- Data mining applications in sports