Appendix B: Statistical Tables and Soccer Benchmarks
This appendix collects reference tables for the probability distributions, critical values, and soccer-specific benchmarks used throughout the text. While modern software eliminates the need to look up values by hand, these tables provide useful reference points for quick estimation and sanity-checking results.
B.1 Probability Distributions
B.1.1 Standard Normal Distribution
The standard normal distribution $Z \sim \mathcal{N}(0, 1)$ has PDF:
$$\phi(z) = \frac{1}{\sqrt{2\pi}} e^{-z^2/2}$$
and CDF $\Phi(z) = P(Z \leq z)$.
Upper-Tail Probabilities: $P(Z > z)$
| $z$ | 0.00 | 0.01 | 0.02 | 0.03 | 0.04 | 0.05 | 0.06 | 0.07 | 0.08 | 0.09 |
|---|---|---|---|---|---|---|---|---|---|---|
| 0.0 | .5000 | .4960 | .4920 | .4880 | .4840 | .4801 | .4761 | .4721 | .4681 | .4641 |
| 0.1 | .4602 | .4562 | .4522 | .4483 | .4443 | .4404 | .4364 | .4325 | .4286 | .4247 |
| 0.2 | .4207 | .4168 | .4129 | .4090 | .4052 | .4013 | .3974 | .3936 | .3897 | .3859 |
| 0.3 | .3821 | .3783 | .3745 | .3707 | .3669 | .3632 | .3594 | .3557 | .3520 | .3483 |
| 0.4 | .3446 | .3409 | .3372 | .3336 | .3300 | .3264 | .3228 | .3192 | .3156 | .3121 |
| 0.5 | .3085 | .3050 | .3015 | .2981 | .2946 | .2912 | .2877 | .2843 | .2810 | .2776 |
| 0.6 | .2743 | .2709 | .2676 | .2643 | .2611 | .2578 | .2546 | .2514 | .2483 | .2451 |
| 0.7 | .2420 | .2389 | .2358 | .2327 | .2296 | .2266 | .2236 | .2206 | .2177 | .2148 |
| 0.8 | .2119 | .2090 | .2061 | .2033 | .2005 | .1977 | .1949 | .1922 | .1894 | .1867 |
| 0.9 | .1841 | .1814 | .1788 | .1762 | .1736 | .1711 | .1685 | .1660 | .1635 | .1611 |
| 1.0 | .1587 | .1562 | .1539 | .1515 | .1492 | .1469 | .1446 | .1423 | .1401 | .1379 |
| 1.1 | .1357 | .1335 | .1314 | .1292 | .1271 | .1251 | .1230 | .1210 | .1190 | .1170 |
| 1.2 | .1151 | .1131 | .1112 | .1093 | .1075 | .1056 | .1038 | .1020 | .1003 | .0985 |
| 1.3 | .0968 | .0951 | .0934 | .0918 | .0901 | .0885 | .0869 | .0853 | .0838 | .0823 |
| 1.4 | .0808 | .0793 | .0778 | .0764 | .0749 | .0735 | .0721 | .0708 | .0694 | .0681 |
| 1.5 | .0668 | .0655 | .0643 | .0630 | .0618 | .0606 | .0594 | .0582 | .0571 | .0559 |
| 1.6 | .0548 | .0537 | .0526 | .0516 | .0505 | .0495 | .0485 | .0475 | .0465 | .0455 |
| 1.7 | .0446 | .0436 | .0427 | .0418 | .0409 | .0401 | .0392 | .0384 | .0375 | .0367 |
| 1.8 | .0359 | .0351 | .0344 | .0336 | .0329 | .0322 | .0314 | .0307 | .0301 | .0294 |
| 1.9 | .0287 | .0281 | .0274 | .0268 | .0262 | .0256 | .0250 | .0244 | .0239 | .0233 |
| 2.0 | .0228 | .0222 | .0217 | .0212 | .0207 | .0202 | .0197 | .0192 | .0188 | .0183 |
| 2.1 | .0179 | .0174 | .0170 | .0166 | .0162 | .0158 | .0154 | .0150 | .0146 | .0143 |
| 2.2 | .0139 | .0136 | .0132 | .0129 | .0125 | .0122 | .0119 | .0116 | .0113 | .0110 |
| 2.3 | .0107 | .0104 | .0102 | .0099 | .0096 | .0094 | .0091 | .0089 | .0087 | .0084 |
| 2.4 | .0082 | .0080 | .0078 | .0075 | .0073 | .0071 | .0069 | .0068 | .0066 | .0064 |
| 2.5 | .0062 | .0060 | .0059 | .0057 | .0055 | .0054 | .0052 | .0051 | .0049 | .0048 |
| 2.6 | .0047 | .0045 | .0044 | .0043 | .0041 | .0040 | .0039 | .0038 | .0037 | .0036 |
| 2.7 | .0035 | .0034 | .0033 | .0032 | .0031 | .0030 | .0029 | .0028 | .0027 | .0026 |
| 2.8 | .0026 | .0025 | .0024 | .0023 | .0023 | .0022 | .0021 | .0021 | .0020 | .0019 |
| 2.9 | .0019 | .0018 | .0018 | .0017 | .0016 | .0016 | .0015 | .0015 | .0014 | .0014 |
| 3.0 | .0013 | .0013 | .0013 | .0012 | .0012 | .0011 | .0011 | .0011 | .0010 | .0010 |
Commonly Used Critical Values:
| Confidence Level | $\alpha$ | $z_{\alpha/2}$ |
|---|---|---|
| 90% | 0.10 | 1.645 |
| 95% | 0.05 | 1.960 |
| 99% | 0.01 | 2.576 |
| 99.9% | 0.001 | 3.291 |
B.1.2 Poisson Distribution
The Poisson distribution $X \sim \text{Pois}(\lambda)$ gives $P(X = k) = \frac{\lambda^k e^{-\lambda}}{k!}$.
Poisson Probabilities for Selected Values of $\lambda$
| $k$ | $\lambda = 0.5$ | $\lambda = 1.0$ | $\lambda = 1.2$ | $\lambda = 1.5$ | $\lambda = 2.0$ | $\lambda = 2.5$ | $\lambda = 3.0$ |
|---|---|---|---|---|---|---|---|
| 0 | .6065 | .3679 | .3012 | .2231 | .1353 | .0821 | .0498 |
| 1 | .3033 | .3679 | .3614 | .3347 | .2707 | .2052 | .1494 |
| 2 | .0758 | .1839 | .2169 | .2510 | .2707 | .2565 | .2240 |
| 3 | .0126 | .0613 | .0867 | .1255 | .1804 | .2138 | .2240 |
| 4 | .0016 | .0153 | .0260 | .0471 | .0902 | .1336 | .1680 |
| 5 | .0002 | .0031 | .0062 | .0141 | .0361 | .0668 | .1008 |
| 6 | .0000 | .0005 | .0012 | .0035 | .0120 | .0278 | .0504 |
| 7 | .0000 | .0001 | .0002 | .0008 | .0034 | .0099 | .0216 |
Note: The values $\lambda = 1.2$ and $\lambda = 1.5$ are particularly relevant for soccer, as they approximate typical home and away goal-scoring rates in major European leagues (see Section B.3).
Cumulative Poisson Probabilities $P(X \leq k)$ for $\lambda = 1.5$ (typical home goals)
| $k$ | $P(X = k)$ | $P(X \leq k)$ |
|---|---|---|
| 0 | .2231 | .2231 |
| 1 | .3347 | .5578 |
| 2 | .2510 | .8088 |
| 3 | .1255 | .9344 |
| 4 | .0471 | .9814 |
| 5 | .0141 | .9955 |
| 6+ | .0045 | 1.0000 |
B.1.3 Binomial Distribution
The binomial distribution $X \sim \text{Bin}(n, p)$ gives $P(X = k) = \binom{n}{k} p^k (1-p)^{n-k}$.
Binomial Probabilities for $n = 10$ (e.g., 10 shots on target)
| $k$ | $p = 0.10$ | $p = 0.20$ | $p = 0.30$ | $p = 0.40$ | $p = 0.50$ |
|---|---|---|---|---|---|
| 0 | .3487 | .1074 | .0282 | .0060 | .0010 |
| 1 | .3874 | .2684 | .1211 | .0403 | .0098 |
| 2 | .1937 | .3020 | .2335 | .1209 | .0439 |
| 3 | .0574 | .2013 | .2668 | .2150 | .1172 |
| 4 | .0112 | .0881 | .2001 | .2508 | .2051 |
| 5 | .0015 | .0264 | .1029 | .2007 | .2461 |
| 6 | .0001 | .0055 | .0368 | .1115 | .2051 |
| 7 | .0000 | .0008 | .0090 | .0425 | .1172 |
| 8 | .0000 | .0001 | .0014 | .0106 | .0439 |
| 9 | .0000 | .0000 | .0001 | .0016 | .0098 |
| 10 | .0000 | .0000 | .0000 | .0001 | .0010 |
Note: A shot-on-target conversion rate of $p = 0.30$ is a useful benchmark for top strikers in major leagues.
B.1.4 Student's $t$-Distribution
The $t$-distribution with $\nu$ degrees of freedom is used for inference about means with unknown population variance. As $\nu \to \infty$, $t_\nu \to \mathcal{N}(0,1)$.
Critical Values $t_{\alpha, \nu}$ (Upper-Tail)
| $\nu$ | $\alpha = 0.10$ | $\alpha = 0.05$ | $\alpha = 0.025$ | $\alpha = 0.01$ | $\alpha = 0.005$ |
|---|---|---|---|---|---|
| 1 | 3.078 | 8.314 | 14.706 | 31.821 | 63.657 |
| 2 | 1.886 | 2.920 | 4.303 | 8.965 | 11.925 |
| 3 | 1.638 | 2.353 | 3.182 | 4.541 | 7.841 |
| 4 | 1.533 | 2.132 | 2.776 | 3.747 | 4.604 |
| 5 | 1.476 | 2.015 | 2.571 | 3.365 | 4.032 |
| 6 | 1.440 | 1.943 | 2.447 | 3.143 | 3.707 |
| 7 | 1.415 | 1.895 | 2.365 | 2.998 | 3.499 |
| 8 | 1.397 | 1.860 | 2.306 | 2.896 | 3.355 |
| 9 | 1.383 | 1.833 | 2.262 | 2.821 | 3.250 |
| 10 | 1.372 | 1.812 | 2.228 | 2.764 | 3.169 |
| 15 | 1.341 | 1.753 | 2.131 | 2.602 | 2.947 |
| 20 | 1.325 | 1.725 | 2.086 | 2.528 | 2.845 |
| 25 | 1.316 | 1.708 | 2.060 | 2.485 | 2.787 |
| 30 | 1.310 | 1.697 | 2.042 | 2.457 | 2.750 |
| 40 | 1.303 | 1.684 | 2.021 | 2.423 | 2.704 |
| 60 | 1.296 | 1.671 | 2.000 | 2.390 | 2.660 |
| 120 | 1.289 | 1.658 | 1.980 | 2.358 | 2.617 |
| $\infty$ | 1.282 | 1.645 | 1.960 | 2.326 | 2.576 |
For a two-tailed test at significance level $\alpha$, use the $\alpha/2$ column.
B.1.5 Chi-Squared Distribution
The chi-squared distribution $\chi^2_\nu$ with $\nu$ degrees of freedom. Used for goodness-of-fit tests, tests of independence, and comparing nested models.
Critical Values $\chi^2_{\alpha, \nu}$ (Upper-Tail)
| $\nu$ | $\alpha = 0.10$ | $\alpha = 0.05$ | $\alpha = 0.025$ | $\alpha = 0.01$ | $\alpha = 0.005$ |
|---|---|---|---|---|---|
| 1 | 2.706 | 3.841 | 7.024 | 8.635 | 9.879 |
| 2 | 4.605 | 7.991 | 9.378 | 11.210 | 12.597 |
| 3 | 8.251 | 9.815 | 11.348 | 13.345 | 14.838 |
| 4 | 9.779 | 11.488 | 13.143 | 15.277 | 16.860 |
| 5 | 11.236 | 13.070 | 14.833 | 17.086 | 18.750 |
| 6 | 12.645 | 14.592 | 16.449 | 18.812 | 20.548 |
| 7 | 14.017 | 16.067 | 18.013 | 20.475 | 22.278 |
| 8 | 15.362 | 17.507 | 19.535 | 22.090 | 23.955 |
| 9 | 16.684 | 18.919 | 21.023 | 23.666 | 25.589 |
| 10 | 17.987 | 20.307 | 22.483 | 25.209 | 27.188 |
| 15 | 24.307 | 26.996 | 29.488 | 30.578 | 32.801 |
| 20 | 30.412 | 31.410 | 34.170 | 37.566 | 39.997 |
| 25 | 34.382 | 37.652 | 40.646 | 44.314 | 46.928 |
| 30 | 40.256 | 43.773 | 46.979 | 50.892 | 53.672 |
B.1.6 F-Distribution Selected Critical Values
The $F$-distribution $F_{\nu_1, \nu_2}$ is used for ANOVA and comparing regression models.
$F_{0.05, \nu_1, \nu_2}$ Critical Values
| $\nu_2 \backslash \nu_1$ | 1 | 2 | 3 | 4 | 5 | 10 | 20 |
|---|---|---|---|---|---|---|---|
| 5 | 8.608 | 7.786 | 7.409 | 7.192 | 7.050 | 4.735 | 4.558 |
| 10 | 4.965 | 4.103 | 3.708 | 3.478 | 3.326 | 2.978 | 2.774 |
| 15 | 4.543 | 3.682 | 3.287 | 3.056 | 2.901 | 2.544 | 2.328 |
| 20 | 4.351 | 3.493 | 3.098 | 2.866 | 2.711 | 2.348 | 2.124 |
| 30 | 4.171 | 3.316 | 2.922 | 2.690 | 2.534 | 2.165 | 1.932 |
| 60 | 4.001 | 3.150 | 2.758 | 2.525 | 2.368 | 1.993 | 1.748 |
| 120 | 3.920 | 3.072 | 2.680 | 2.447 | 2.290 | 1.910 | 1.659 |
B.2 Critical Values Quick Reference
B.2.1 Common Significance Thresholds
| Test Type | Significance Level | Critical Value | Distribution |
|---|---|---|---|
| Two-tailed $z$-test | $\alpha = 0.05$ | $\pm 1.960$ | Standard Normal |
| Two-tailed $z$-test | $\alpha = 0.01$ | $\pm 2.576$ | Standard Normal |
| One-tailed $z$-test | $\alpha = 0.05$ | $1.645$ | Standard Normal |
| Two-tailed $t$-test ($\nu = 30$) | $\alpha = 0.05$ | $\pm 2.042$ | $t_{30}$ |
| Chi-squared ($\nu = 1$) | $\alpha = 0.05$ | $3.841$ | $\chi^2_1$ |
| Chi-squared ($\nu = 5$) | $\alpha = 0.05$ | $13.070$ | $\chi^2_5$ |
B.2.2 Sample Size Guidelines for Soccer Analytics
These rules of thumb help determine when certain approximations are reasonable:
| Scenario | Minimum $n$ | Notes |
|---|---|---|
| Central Limit Theorem approximation | 30 | For approximately symmetric distributions |
| Normal approximation to Poisson | $\lambda \geq 10$ | Rare in single-match contexts; applicable to season totals |
| Normal approximation to Binomial | $np \geq 5$ and $n(1-p) \geq 5$ | E.g., 20 shots at 30% conversion: $np = 6$ |
| Stable xG estimates for a player | ~50 shots | Fewer for high-volume shooters |
| Stable pass completion rates | ~200 passes | Position-dependent |
| Reliable season-level team metrics | ~15 matches | Half a typical league season |
| Robust player ratings | ~900 minutes | Approximately 10 full matches |
B.3 Soccer Benchmarks
B.3.1 Goals Per Game by League
Average goals per match across Europe's top five leagues (approximate values; season-to-season variation of roughly $\pm 0.15$).
| League | Goals/Game | Home Goals/Game | Away Goals/Game | Home Win % | Draw % | Away Win % |
|---|---|---|---|---|---|---|
| Premier League (England) | 2.70 | 1.52 | 1.18 | 43% | 24% | 33% |
| La Liga (Spain) | 2.55 | 1.45 | 1.10 | 45% | 24% | 31% |
| Bundesliga (Germany) | 2.90 | 1.60 | 1.30 | 43% | 22% | 35% |
| Serie A (Italy) | 2.65 | 1.48 | 1.17 | 44% | 25% | 31% |
| Ligue 1 (France) | 2.55 | 1.42 | 1.13 | 44% | 25% | 31% |
| Eredivisie (Netherlands) | 3.10 | 1.72 | 1.38 | 44% | 21% | 35% |
| MLS (USA/Canada) | 2.80 | 1.55 | 1.25 | 45% | 24% | 31% |
| Champions League (group stage) | 2.85 | 1.60 | 1.25 | 46% | 22% | 32% |
B.3.2 Expected Goals (xG) Benchmarks
| Metric | Typical Range | Elite Benchmark | Notes |
|---|---|---|---|
| Team xG per match (top league) | 1.0 -- 1.8 | > 2.0 | Manchester City 2017--2023 averaged ~2.2 |
| Team xGA per match (top league) | 0.8 -- 1.5 | < 0.8 | Best defenses consistently under 1.0 |
| xG per shot (average) | 0.08 -- 0.12 | -- | Varies by league and era |
| xG per shot (penalty) | 0.76 -- 0.79 | -- | Approximately 76--79% conversion rate |
| xG per shot (open play, inside box) | 0.10 -- 0.20 | -- | Depends on angle and distance |
| xG per shot (open play, outside box) | 0.02 -- 0.05 | -- | Long-range shots are low-probability |
| xG per shot (header) | 0.04 -- 0.08 | -- | Headers convert at lower rates |
| Player Goals minus xG (season) | -5 to +5 | > +5 | Sustained overperformance is rare |
B.3.3 Passing Benchmarks
| Metric | Average | Top Quartile | Elite (Top 5%) |
|---|---|---|---|
| Overall pass completion % | 80--85% | 87--90% | > 90% |
| Short pass completion % | 88--92% | 93--95% | > 95% |
| Long pass completion % | 55--65% | 68--72% | > 75% |
| Progressive passes per 90 | 4--7 | 8--10 | > 12 |
| Key passes per 90 | 0.8--1.5 | 1.8--2.5 | > 3.0 |
| Passes into final third per 90 | 4--8 | 9--12 | > 14 |
| Pass completion % (under pressure) | 70--78% | 80--84% | > 85% |
Note: Passing statistics are heavily position-dependent. Center-backs and defensive midfielders typically have higher completion rates but fewer progressive passes than attacking players.
B.3.4 Defensive Benchmarks
| Metric | Average | Top Quartile | Elite (Top 5%) |
|---|---|---|---|
| Tackles per 90 | 1.5--2.5 | 2.8--3.5 | > 4.0 |
| Interceptions per 90 | 1.0--1.8 | 2.0--2.5 | > 3.0 |
| Aerial duels won % | 45--55% | 58--65% | > 68% |
| Pressures per 90 | 15--22 | 23--28 | > 30 |
| Pressure success rate | 25--32% | 33--38% | > 40% |
| Clearances per 90 | 2.5--4.5 | 7.0--8.5 | > 9.0 |
| Blocks per 90 | 1.0--1.8 | 2.0--2.5 | > 3.0 |
B.3.5 Possession and Territorial Benchmarks
| Metric | Bottom Quartile | Median | Top Quartile | Elite |
|---|---|---|---|---|
| Possession % | < 45% | 50% | > 55% | > 65% |
| Touches in opp. box per 90 | < 18 | 22--26 | 28--34 | > 38 |
| PPDA (passes per defensive action) | < 8 (high press) | 10--12 | 14--16 | > 18 (low press) |
| Field tilt (% of touches in opp. third) | < 35% | 40--45% | 48--52% | > 55% |
| Build-up attacks per 90 | < 25 | 30--38 | 40--48 | > 52 |
B.3.6 Physical Performance Benchmarks (per 90 minutes)
| Metric | Average | High | Elite |
|---|---|---|---|
| Total distance (km) | 12.0--12.8 | 13.0--13.5 | > 14.0 |
| High-speed running distance (km) (> 21.8 km/h) | 0.8--1.2 | 1.3--1.6 | > 1.8 |
| Sprint distance (km) (> 27.2 km/h) | 0.25--0.40 | 0.42--0.55 | > 0.60 |
| Number of sprints | 30--45 | 48--60 | > 65 |
| Top speed (km/h) | 28--31 | 32--34 | > 35 |
| Accelerations (> 3 m/s^2) | 40--55 | 58--70 | > 75 |
| Decelerations (> 3 m/s^2) | 38--52 | 55--68 | > 72 |
Physical benchmarks vary considerably by position. Full-backs and wide midfielders typically cover the most high-speed distance, while center-backs cover the least.
B.3.7 Goalkeeper Benchmarks
| Metric | Average | Good | Elite |
|---|---|---|---|
| Save percentage (all shots) | 68--72% | 73--76% | > 78% |
| Post-shot xG minus Goals Allowed per 90 | -0.05 to 0.05 | 0.05--0.15 | > 0.15 |
| Distribution accuracy (short) | 85--90% | 91--94% | > 95% |
| Distribution accuracy (long) | 40--50% | 52--58% | > 60% |
| Cross claim % | 5--8% | 9--12% | > 14% |
| Sweeper actions (OPA per 90) | 0.5--1.0 | 1.2--1.8 | > 2.0 |
B.3.8 Common Scoreline Probabilities
Based on a Poisson model with $\lambda_{\text{home}} = 1.5$ and $\lambda_{\text{away}} = 1.15$ (assuming independence):
| Scoreline | Probability | Cumulative (most likely) |
|---|---|---|
| 1-0 | 14.0% | 14.0% |
| 1-1 | 13.6% | 25.6% |
| 0-0 | 10.1% | 31.7% |
| 2-1 | 10.7% | 40.4% |
| 0-1 | 11.3% | 49.7% |
| 2-0 | 11.0% | 58.7% |
| 2-2 | 4.1% | 62.8% |
| 0-2 | 7.4% | 68.2% |
| 3-1 | 4.3% | 72.5% |
| 3-0 | 4.5% | 77.0% |
| 1-2 | 8.7% | 83.7% |
| 3-2 | 2.5% | 86.2% |
| All others | 15.8% | 100.0% |
Note: The independence assumption (home goals independent of away goals) is a simplification. In practice, game state effects, tactical adjustments, and other factors introduce dependencies. Chapter 7 discusses more sophisticated approaches.
For interactive computation of these distributions, see the Python functions in Appendix C. For the mathematical foundations underlying these distributions, see Appendix A, Section A.2.