Further Reading: Rushing and Running Game Analysis
Foundational Resources
Academic Papers
-
"The Value of an Elite Running Back" - Football Outsiders Research - Analyzes replacement-level theory applied to RBs - Quantifies marginal value of rushing production - Discusses roster construction implications
-
"Yards After Contact as a Predictor of Future Performance" - Studies YAC stability year-over-year - Compares YAC to YPC predictive value - Establishes portable skill framework
-
"Expected Rushing Yards: A New Framework" - Sports Analytics Conference - Details methodology for expected rushing models - Feature selection for pre-snap predictions - Validation and calibration approaches
-
"Success Rate vs. EPA: Which Better Predicts Rushing Performance?" - Comparative analysis of efficiency metrics - Split-half reliability testing - Recommendations for different use cases
Books
-
"Football Analytics: Unlocking Performance" - Chapter on running game analysis - Blocking metrics and attribution - Team-level rushing evaluation
-
"The Hidden Game of Football" by Carroll, Palmer, and Thorn - Historical perspective on rushing value - Early efficiency metrics - Context-adjusted statistics
-
"Mathletics" by Wayne Winston - Sports analytics methodology - Statistical thinking applied to football - Expected value frameworks
Online Resources
Data Sources
-
cfbfastR (College Football) - R package for college football play-by-play - Pre-calculated EPA and success rate - Historical data back to 2003 - GitHub: github.com/sportsdataverse/cfbfastR
-
nflfastR (NFL Reference) - R package for NFL play-by-play - Expected yards models - Comprehensive rushing metrics - GitHub: github.com/nflverse/nflfastR
-
College Football Reference - Traditional rushing statistics - Player and team records - Historical comparisons
-
Pro Football Focus (PFF) - Yards before contact data - Blocking grades - Premium rushing metrics
-
Sports Info Solutions (SIS) - Broken tackle data - Missed tackle rates - Charting-based metrics
Tutorials and Guides
-
"Calculating Rush Yards Over Expected" - Open Source Football - Step-by-step RYOE implementation - Feature engineering examples - Model evaluation techniques - URL: opensourcefootball.com
-
"Understanding Success Rate" - Football Outsiders - Conceptual explanation - Historical development - Application examples
-
"YAC Analysis in Python" - Sports Analytics Tutorial - Python implementation - Visualization techniques - Comparative analysis code
-
"Building an Expected Rushing Model" - Kaggle Notebook - Full ML pipeline - Feature importance analysis - Model comparison
Tools and Libraries
Python Packages
# Core data manipulation
import pandas as pd
import numpy as np
# Statistical analysis
from scipy import stats
import statsmodels.api as sm
# Machine learning for expected models
from sklearn.linear_model import LinearRegression, Ridge
from sklearn.ensemble import RandomForestRegressor, GradientBoostingRegressor
from sklearn.model_selection import cross_val_score
# Visualization
import matplotlib.pyplot as plt
import seaborn as sns
# Sports-specific
# pip install sportsdataverse # College football data
# pip install nfl-data-py # NFL data wrapper
R Packages
# College football data
install.packages("cfbfastR")
# NFL reference data
install.packages("nflfastR")
# Data manipulation
install.packages("tidyverse")
# Visualization
install.packages("ggplot2")
install.packages("gt") # Tables
Advanced Topics
Expected Rushing Models
-
Linear Models - Simple: Box count adjustment only - Intermediate: Gap, formation, down/distance - Paper: "A Linear Approach to Expected Rushing Yards"
-
Machine Learning Approaches - Random Forest for expected yards - XGBoost implementations - Neural network approaches - Paper: "Machine Learning for Pre-Snap Rushing Predictions"
-
Bayesian Methods - Hierarchical models for RB evaluation - Uncertainty quantification - Prior information incorporation
Blocking Analysis
-
Zone Blocking Metrics - Movement and displacement measurement - Zone vs. gap scheme efficiency - Resource: Football Outsiders Zone Blocking Analysis
-
Individual Lineman Grading - PFF methodology overview - Win rate calculations - Attribution challenges
-
Tracking Data Applications - Pre-snap alignment analysis - Block quality from player movement - Next Gen Stats rushing metrics
Situational Analysis Deep Dives
-
Short Yardage Analytics - Formation tendencies - Personnel grouping impact - Success rate by play type
-
Goal Line Rushing - TD rate by distance - Formation analysis - Personnel optimization
-
Late Game Rushing - Clock management value - Win probability impact - Efficiency changes with lead
Research Questions for Further Study
RB Evaluation
- How stable is RYOE year-over-year compared to YPC?
- What is the optimal workload for RB efficiency?
- How does age affect rushing metrics differently?
- Can we predict breakout seasons from advanced metrics?
Blocking and Attribution
- What percentage of rushing success is attributable to blocking vs. RB skill?
- How do zone vs. gap schemes affect RYOE?
- Can we isolate individual lineman impact on rushing?
- How does tight end blocking affect outside run success?
Scheme Analysis
- Which formations produce highest RYOE?
- How does pre-snap motion affect rushing success?
- What is the optimal run/pass balance for rushing efficiency?
- How do RPOs affect pure rushing metrics?
Situational Deep Dives
- Is short-yardage ability a persistent skill?
- How does game script affect RB efficiency metrics?
- What is the value of late-game rushing reliability?
- How do weather conditions affect rushing metrics?
Video Resources
YouTube Channels
- Brett Kollmann - Film breakdown including rushing analysis
- JT O'Sullivan - QB School (includes run game concepts)
- Baldy Breakdowns - Brian Baldinger's analysis
Courses and Lectures
- MIT Sloan Sports Analytics Conference - Rushing analytics presentations
- Coursera Sports Analytics - General methodology
- StatsBomb Conference - Soccer parallels (expected goals → expected yards)
Community and Forums
Online Communities
- r/NFLstatheads - Reddit community for football analytics
- r/CFBAnalysis - College football analytics discussion
- Football Outsiders Forums - FO methodology discussion
- Fantasy Football Analytics - Applied rushing metrics
Twitter/X Accounts to Follow
- @benbbaldwin - nflfastR creator
- @thomasmock - Sports data visualization
- @PFF - Pro Football Focus
- @SethWalder - ESPN analytics
- @CamPen66 - Expected rushing models
Conferences
- MIT Sloan Sports Analytics Conference - Annual, March
- SABR Analytics Conference - Baseball-focused but relevant methods
- Carnegie Mellon Sports Analytics Conference - Academic focus
- Great Lakes Analytics Conference - Regional, applied focus
Practice Datasets
Public Datasets
-
cfbfastR Play-by-Play - 2000-present college football - Includes EPA, success rate - Free access via R/Python
-
NFL Play-by-Play (nflfastR) - 1999-present NFL data - Pre-calculated metrics - Ideal for model training
-
Kaggle NFL Big Data Bowl - Tracking data samples - Annual competitions - Creative feature engineering
Sample Projects
-
Build an RYOE Model - Start with box count only - Add features incrementally - Compare to published models
-
RB Consistency Study - Calculate success rate by week - Analyze variance patterns - Identify reliable backs
-
Blocking Attribution Analysis - Compare YBC across teams for same RB - Study backs who changed teams - Quantify line contribution
Citation Format
When referencing rushing analytics in academic or professional work:
Traditional metrics:
- Yards per carry (YPC)
- Rushing yards per game
Advanced metrics:
- Success rate (Football Outsiders, 2003)
- Yards after contact (Pro Football Focus)
- Rush Yards Over Expected (RYOE) - Baldwin et al.
- Expected Points Added (EPA) - Burke, 2014
Recommended Learning Path
Beginner (Weeks 1-4)
- Read Football Outsiders success rate methodology
- Calculate basic rushing stats in Python
- Understand YPC limitations
- Practice with cfbfastR data
Intermediate (Weeks 5-8)
- Implement YAC calculations
- Build simple expected rushing model
- Study blocking metrics
- Analyze situational splits
Advanced (Weeks 9-12)
- Build ML-based RYOE model
- Create attribution framework
- Develop composite evaluation system
- Publish analysis on personal blog
Expert (Ongoing)
- Incorporate tracking data
- Build real-time systems
- Develop novel metrics
- Contribute to open-source projects