Quiz: Statistical Foundations for Football Analytics
Target: 70% or higher to proceed.
Section 1: Multiple Choice (1 point each)
1. What does EPA measure?
- A) Total points scored on a play
- B) The change in expected points before and after a play
- C) The probability of scoring on the next play
- D) The point value of field position
Answer
**B)** The change in expected points before and after a play *Explanation:* EPA = EP_after - EP_before, measuring how much a play changed the team's expected point outcome.2. If a team has 60% win probability at halftime, what does this mean?
- A) They will definitely win
- B) Based on historical data, teams in similar situations won 60% of the time
- C) They have scored 60% of the points
- D) They are 60% better than the opponent
Answer
**B)** Based on historical data, teams in similar situations won 60% of the time *Explanation:* Win probability is based on historical outcomes of games with similar game states (score, time, possession, etc.).3. What is the relationship between statistical significance and practical significance?
- A) They are the same thing
- B) Statistical significance guarantees practical significance
- C) A result can be statistically significant but not practically significant
- D) Practical significance is not measurable
Answer
**C)** A result can be statistically significant but not practically significant *Explanation:* With large samples, tiny differences can be statistically significant. Effect size measures practical importance.4. What is Cohen's d?
- A) A measure of statistical significance
- B) A standardized effect size measure
- C) A type of regression coefficient
- D) A probability value
Answer
**B)** A standardized effect size measure *Explanation:* Cohen's d measures the difference between means in standard deviation units, allowing comparison across studies.5. Why is the average EPA for passes higher than for runs?
- A) Passes are counted more
- B) Passes have higher ceiling and successful passes gain more yards/EPA
- C) NFL rules favor passing
- D) Run plays are always negative
Answer
**B)** Passes have higher ceiling and successful passes gain more yards/EPA *Explanation:* While passes are riskier, successful passes tend to gain more yards and create more value than runs.6. What does a p-value of 0.03 mean?
- A) There's a 3% chance the null hypothesis is true
- B) If the null hypothesis is true, there's a 3% chance of seeing results this extreme
- C) The effect size is 3%
- D) The result is 97% reliable
Answer
**B)** If the null hypothesis is true, there's a 3% chance of seeing results this extreme *Explanation:* P-values represent the probability of observing the data (or more extreme) assuming the null hypothesis is true.7. What problem does the Bonferroni correction address?
- A) Sample size issues
- B) Inflation of Type I error when making multiple comparisons
- C) Non-normal data
- D) Heteroscedasticity
Answer
**B)** Inflation of Type I error when making multiple comparisons *Explanation:* When running many tests, some will appear significant by chance. Bonferroni adjusts the significance threshold.8. In logistic regression, what do the coefficients represent?
- A) Change in the outcome variable
- B) Change in log-odds per unit change in predictor
- C) Probability of success
- D) Correlation with the outcome
Answer
**B)** Change in log-odds per unit change in predictor *Explanation:* Logistic regression models log-odds. Exponentiating coefficients gives odds ratios.9. What is regression to the mean?
- A) All values eventually become average
- B) Extreme values tend to be followed by less extreme values
- C) Regression coefficients equal the mean
- D) Mean values don't change over time
Answer
**B)** Extreme values tend to be followed by less extreme values *Explanation:* Due to random variation, extreme performances include some luck and tend to regress toward average.10. What is survivorship bias?
- A) Only analyzing successful outcomes, missing failures
- B) Studying only players who survived injuries
- C) Teams that survive to playoffs are better
- D) Statistical bias in survival analysis
Answer
**A)** Only analyzing successful outcomes, missing failures *Explanation:* If we only study successful draft picks (starters), we miss busts and overestimate success rates.11. What does Win Probability Added (WPA) measure?
- A) Probability of winning
- B) How much a play changed the team's win probability
- C) Average wins added per player
- D) Expected wins for a team
Answer
**B)** How much a play changed the team's win probability *Explanation:* WPA = WP_after - WP_before, measuring each play's contribution to win probability.12. When comparing two proportions (like catch rates), which test is appropriate?
- A) t-test
- B) ANOVA
- C) z-test for proportions
- D) Chi-square test
Answer
**C)** z-test for proportions *Explanation:* When comparing proportions (binary outcomes), the z-test for proportions is appropriate.Section 2: True/False (1 point each)
13. A 95% confidence interval means there's a 95% probability the true parameter is in that interval.
Answer
**False** *Explanation:* The true parameter is fixed. A 95% CI means if we repeated the experiment many times, 95% of calculated intervals would contain the true value.14. With a large enough sample size, any difference will be statistically significant.
Answer
**True** *Explanation:* As n increases, standard error decreases, making even tiny differences detectable. This is why effect size matters.15. EPA can be negative for a play.
Answer
**True** *Explanation:* Plays that reduce expected points (turnovers, sacks, failed third downs) have negative EPA.16. Correlation between pass rate and winning proves that passing more causes wins.
Answer
**False** *Explanation:* Correlation ≠ causation. Winning teams often pass more because they have leads, not because passing causes winning.17. Ridge regression is preferred when you want to select a subset of important features.
Answer
**False** *Explanation:* Lasso regression sets coefficients to zero, performing feature selection. Ridge shrinks but doesn't zero out.18. Bayes' theorem allows us to update probabilities as new evidence arrives.
Answer
**True** *Explanation:* Bayesian updating combines prior beliefs with new evidence to form posterior probabilities.Section 3: Calculation (2 points each)
19. A kicker makes 42 of 50 field goal attempts. Calculate: - Success rate - 95% confidence interval for true success rate
Answer
**Success rate:** 42/50 = 84% **95% CI:** - p = 0.84, n = 50 - SE = √(0.84 × 0.16 / 50) = 0.0518 - z = 1.96 for 95% - CI = 0.84 ± 1.96 × 0.0518 = (0.738, 0.942) The 95% confidence interval is approximately **73.8% to 94.2%**.20. QB A has 0.15 EPA/play on 300 passes. QB B has 0.10 EPA/play on 350 passes. Assume pooled standard deviation is 1.2. Calculate Cohen's d.
Answer
**Cohen's d = (Mean₁ - Mean₂) / Pooled SD** d = (0.15 - 0.10) / 1.2 = 0.05 / 1.2 = **0.042** This is a **negligible** effect size (< 0.2). Even if statistically significant due to large n, the practical difference is minimal.21. A team wins 70% of games when leading at halftime. They lead at halftime in 55% of games. Calculate P(Lead at Half | Win).
Answer
Using Bayes' theorem: - P(Win | Lead) = 0.70 - P(Lead) = 0.55 - P(Win) = P(Win|Lead)×P(Lead) + P(Win|Trail)×P(Trail) Assume P(Win | Trail) = 0.30 (complement roughly) - P(Win) = 0.70 × 0.55 + 0.30 × 0.45 = 0.385 + 0.135 = 0.52 P(Lead | Win) = P(Win|Lead) × P(Lead) / P(Win) P(Lead | Win) = 0.70 × 0.55 / 0.52 = 0.385 / 0.52 = **0.74 (74%)**Section 4: Short Answer (2 points each)
22. Explain why small sample sizes are problematic for evaluating player performance in football.
Sample Answer
Small samples are problematic because: 1. **High variance**: A player's observed performance may differ significantly from true ability due to random variation 2. **Wide confidence intervals**: With few plays, uncertainty around estimates is large, making comparisons unreliable 3. **Regression to the mean**: Extreme performances in small samples often contain luck and won't persist 4. **Underpowered tests**: Statistical tests lack power to detect real differences with small n **Example**: A RB with 5.5 YPC on 20 carries has a 95% CI of roughly 3.5-7.5 YPC—too wide for meaningful conclusions.23. What is selection bias and give a football example.
Sample Answer
**Selection bias** occurs when the sample analyzed is not representative of the population of interest, leading to biased conclusions. **Football Example: Fourth Down Analysis** When analyzing fourth-down conversion rates, we only observe plays where teams *chose* to go for it. Teams typically go for it when: - They're confident in success (good matchup) - Situation is desperate - Distance is very short This means the observed conversion rate (around 50-60%) doesn't represent what would happen if teams went for it *randomly*. The selection of when to attempt is biased toward favorable situations. **Consequence**: We can't simply say "teams should go for it more because conversion rate is 55%"—the rate would likely drop if teams went for it in situations they currently punt.24. Describe the difference between EPA and WPA and when each is more useful.
Sample Answer
**EPA (Expected Points Added)**: - Measures change in expected points - Context-neutral (same EPA for a TD in Week 1 vs Super Bowl) - Better for evaluating pure player skill and efficiency - Use for: player evaluation, team efficiency, play-calling analysis **WPA (Win Probability Added)**: - Measures change in win probability - Context-dependent (a TD in a close game adds more WPA than in a blowout) - Captures "clutch" value and situational importance - Use for: game narratives, identifying crucial plays, clutch performance **Key difference**: A long TD in garbage time has high EPA but low WPA. A short third-down conversion in a close playoff game has low EPA but potentially high WPA. EPA measures efficiency; WPA measures impact on winning.Section 5: Application (3 points each)
25. You find that a QB has 0.25 EPA/play vs 0.18 league average with p = 0.04. The sample is 250 plays. What additional analyses would you perform before concluding this QB is significantly better?
Sample Answer
Additional analyses needed: 1. **Effect size calculation**: Cohen's d to determine if the difference is practically meaningful, not just statistically significant 2. **Confidence interval**: Calculate 95% CI for the QB's EPA to understand uncertainty range 3. **Context adjustment**: - Check if performance differs home/away - Examine opponent strength - Control for supporting cast (receivers, O-line) 4. **Situation breakdown**: - EPA by down and distance - EPA in clean pocket vs pressure - Red zone vs rest of field 5. **Time series check**: Is performance consistent across weeks or driven by outlier games? 6. **Comparison group**: How does the QB rank among starters, not vs all QBs? 7. **Regression analysis**: Control for confounding variables (play-action, receiver separation, etc.) P = 0.04 alone is insufficient—practical significance and robustness checks are essential.26. Design a study to test whether play-action passes are more efficient than non-play-action passes. Address potential confounders.
Sample Answer
**Study Design:** **Hypothesis**: Play-action passes have higher EPA than non-play-action passes **Data**: Filter PBP to pass plays, split by play_action flag **Primary analysis**: Compare mean EPA between groups using two-sample t-test **Potential Confounders**: 1. **Down and distance**: Play-action used more on early downs - Control: Stratify by down or include in regression 2. **Score differential**: Play-action less common when trailing - Control: Include score differential as covariate 3. **Personnel**: Play-action requires different formations - Control: Analyze within similar personnel groupings 4. **Team effects**: Some teams use more play-action - Control: Include team random effects or fixed effects **Regression model**:EPA ~ play_action + down + ydstogo + score_diff + (1|team)
**Robustness checks**:
- Subset to similar situations (early downs, tied games)
- Propensity score matching
- Separate analysis for each team
**Causal limitation**: Teams choose when to use play-action, creating selection bias. True experiment impossible.
Section 6: Matching (1 point each)
Match the statistical concept with its definition:
| Concept | Definition |
|---|---|
| 27a. Type I Error | A. Using sample data to make conclusions about a population |
| 27b. Type II Error | B. Failing to reject a false null hypothesis |
| 27c. Statistical Inference | C. Rejecting a true null hypothesis |
| 27d. Power | D. Probability of correctly rejecting a false null hypothesis |
Answers
**27a. C** - Type I Error: Rejecting a true null hypothesis (false positive) **27b. B** - Type II Error: Failing to reject a false null hypothesis (false negative) **27c. A** - Statistical Inference: Using sample data to make conclusions about a population **27d. D** - Power: Probability of correctly rejecting a false null hypothesisSection 7: True Understanding (2 points each)
28. A analyst claims: "This QB's EPA of 0.22 is in the top 5, proving he's elite." What questions would you ask?
Sample Answer
Questions to ask: 1. **Sample size**: How many plays? CI width? 2. **Statistical significance**: Is 0.22 significantly above average? 3. **Time period**: Full season or cherry-picked stretch? 4. **Supporting cast**: Adjusted for receiver/O-line quality? 5. **Opponents**: Quality of defenses faced? 6. **Situation mix**: Easy situations (3rd and short) vs tough? 7. **Consistency**: Steady performance or outlier games? 8. **Sustainability**: Historical EPA or one-season spike?29. Why might observing that "teams that pass more tend to win" not mean "teams should pass more"?
Sample Answer
This is a classic case of **reverse causation** and **confounding**: 1. **Reverse causation**: Teams that are *winning* tend to pass more - When trailing, teams pass to catch up - When leading, teams can pass comfortably - The relationship is: winning → passing, not passing → winning 2. **Situational confounding**: - Better teams both pass more AND win more - The correlation is due to team quality, not pass rate causing wins 3. **Game script effects**: - Trailing teams face prevent defense (easier passing) - Leading teams face stacked boxes (easier passing) 4. **Selection on success**: - Successful passes (completions, big gains) are counted - Failed passes (incompletions, sacks) lead to running - We observe pass rate conditional on passing working **Bottom line**: Correlation between pass rate and winning is largely spurious—caused by game situation, not a causal effect of passing more.30. What's wrong with saying "Player X had a 90% catch rate, better than Player Y's 85%, so X is the better receiver"?
Sample Answer
Problems with this comparison: 1. **Sample size**: How many targets? 9/10 vs 85/100 is very different - Small samples have high variance 2. **Target quality**: - X might get easy short targets - Y might run deep routes with harder catches 3. **Context differences**: - X might be a slot receiver (shorter routes) - Y might be a perimeter receiver (contested catches) 4. **QB differences**: Different QBs throwing to them 5. **Situation mix**: 3rd down catches vs garbage time 6. **Statistical significance**: Is 90% vs 85% significantly different? - With 50 targets each: CI overlap likely 7. **Value added**: Catch rate doesn't capture: - Yards after catch - EPA per target - Separation creation **Better approach**: Compare EPA per target, YPRR, or catch rate above expected (CAE) based on route difficulty.Scoring
| Section | Points | Your Score |
|---|---|---|
| Multiple Choice (1-12) | 12 | ___ |
| True/False (13-18) | 6 | ___ |
| Calculation (19-21) | 6 | ___ |
| Short Answer (22-24) | 6 | ___ |
| Application (25-26) | 6 | ___ |
| Matching (27) | 4 | ___ |
| True Understanding (28-30) | 6 | ___ |
| Total | 46 | ___ |
Passing Score: 32/46 (70%)