Chapter 15: Key Takeaways

Player Performance Metrics


Core Definition

Player Performance Metrics: A structured, multi-dimensional framework for evaluating individual soccer players using position-specific statistical measures, normalized for playing time, contextualized against peer groups, and combined into composite profiles that support recruitment, development, and tactical decisions.


The Seven Key Points

1. Single Metrics Are Dangerous

Goals and assists capture only a fraction of a player's contribution. Comprehensive evaluation requires metrics spanning four dimensions: scoring, creation, possession, and defense. Within each dimension, both volume (how much?) and efficiency (how well?) matter.

2. Position Determines the Metric Set

The metrics that define excellence vary radically by position. Applying the same yardstick to a goalkeeper and a striker produces meaningless comparisons.

3. Per-90 Normalization Is Essential but Imperfect

Dividing raw counts by minutes played and multiplying by 90 enables fair comparison across different playing times, but small-sample sizes require either minimum-minute thresholds or Bayesian shrinkage to avoid misleading rates.

4. Players Follow Predictable Age Curves

Performance follows an inverted-U shape with age, but the peak age differs by position and by the type of metric measured. Age curves are central to transfer valuation and recruitment strategy.

5. Form Is Measurable

Rolling averages and exponentially weighted moving averages quantify recent performance relative to baseline. The Form Index (recent / baseline) detects hot and cold spells, while the coefficient of variation captures consistency.

6. Radar Charts Reveal the Whole Player

Standardizing metrics to z-scores or percentiles and plotting them on radar charts produces intuitive multi-dimensional profiles that bridge analytics departments and coaching staff.

7. Similarity Algorithms Power Recruitment

Cosine similarity, Euclidean distance, and clustering algorithms answer the fundamental scouting question: "Who else plays like this player?"


Position-Specific Metric Tables

Goalkeepers

Metric What It Measures Why It Matters
PSxG - GA Goals saved above expected Best single measure of shot-stopping
Save percentage Basic stopping rate Simple but ignores shot difficulty
Cross claim rate Command of the box Measures aerial authority
Distribution accuracy Passing from the back Critical in possession-based systems
Clean sheet % Matches without conceding Combines team and individual defense

Defenders

Metric Centre-Backs Full-Backs
Aerial duel win rate Primary Secondary
Tackles + interceptions per 90 Primary Primary
Progressive passes per 90 Important Important
Carries into final third per 90 Secondary Primary
Crosses and xA per 90 Rare Primary (modern)
Errors leading to shots Critical (negative) Critical (negative)

Midfielders

Metric Holding Box-to-Box Attacking
Tackles + interceptions per 90 Primary Important Secondary
Pass completion % Primary Important Secondary
Progressive passes per 90 Important Primary Important
Progressive carries per 90 Secondary Primary Important
xG + xA per 90 Rare Important Primary
Key passes per 90 Secondary Important Primary
Pressure success rate Important Important Secondary

Forwards

Metric Striker Winger
Non-penalty goals per 90 Primary Important
npxG per 90 Primary Important
xG outperformance (npG - npxG) Important Secondary
Shots per 90 Primary Secondary
xA per 90 Secondary Primary
Successful dribbles per 90 Secondary Primary
Touches in penalty area per 90 Primary Important
Pressing actions per 90 System-dependent System-dependent

Essential Formulas

  1. Per-90 normalization: $$\text{Metric}_{p90} = \frac{\text{Raw Count}}{\text{Minutes Played}} \times 90$$

  2. Z-score standardization: $$z_i = \frac{x_i - \mu}{\sigma}$$

  3. Bayesian shrinkage: $$\hat{\mu}_{\text{player}} = \frac{n}{n + \kappa} \cdot \bar{x}_{\text{player}} + \frac{\kappa}{n + \kappa} \cdot \bar{x}_{\text{population}}$$

  4. Aging curve (quadratic): $$\text{Performance}(a) = \beta_0 + \beta_1 a + \beta_2 a^2, \quad a^* = -\frac{\beta_1}{2\beta_2}$$

  5. Exponentially weighted moving average: $$\text{EWMA}_t = \alpha \cdot x_t + (1 - \alpha) \cdot \text{EWMA}_{t-1}$$

  6. Form Index: $$\text{Form Index} = \frac{\bar{x}_{\text{recent}}}{\bar{x}_{\text{baseline}}}$$

  7. Cosine similarity: $$\text{cos}(\mathbf{a}, \mathbf{b}) = \frac{\mathbf{a} \cdot \mathbf{b}}{\|\mathbf{a}\| \|\mathbf{b}\|}$$

  8. Coefficient of variation: $$\text{CV} = \frac{\sigma}{\bar{x}}$$


Key Thresholds and Benchmarks

Parameter Recommended Value Rationale
Minimum minutes (full season) 900 ~10 full matches; stabilizes per-90 rates
Minimum minutes (half season) 450 ~5 full matches
Bayesian shrinkage kappa 900 Minutes at which 50% weight goes to player data
Radar chart axes 6-10 Fewer is unreadable; more is cluttered
Z-score outlier threshold z
Cosine similarity "strong match" > 0.90 Indicates highly similar playing profiles

Common Pitfalls to Avoid

Pitfall Better Approach
Comparing across positions without filtering Always define a position-specific peer group first
Using raw counts to compare players with different minutes Use per-90 rates with minimum-minute thresholds
Treating per-90 rates from 200 minutes as reliable Apply Bayesian shrinkage or exclude sub-threshold players
Including penalties in striker evaluation Use non-penalty metrics (npG, npxG) alongside totals
Ignoring survivorship bias in age curves Use within-player delta methods and acknowledge selection effects
Assuming all xG outperformance is skill Require multi-season persistence before labeling it skill
Confusing cosine similarity (shape) with Euclidean distance (magnitude) Choose the metric that matches the question: "same style?" vs. "same level?"
Building radar charts with inconsistent axis direction Orient all axes so that "farther from center" means "better"

Quick Reference: Aging Curve Peaks

Position Physical Peak Technical Peak Overall Peak
Goalkeeper 27-29 28-32 28-31
Centre-back 25-27 27-30 27-30
Full-back 24-26 26-28 25-28
Central midfielder 25-27 27-30 26-29
Winger 24-26 26-28 25-28
Striker 25-27 26-29 26-29

Self-Check Questions

Before moving to Chapter 16, you should be able to answer:

  1. Why is a multi-dimensional framework necessary for player evaluation, and what are the four main dimensions?
  2. What is per-90 normalization, and what problems does it solve? What problems remain?
  3. How does Bayesian shrinkage differ from a hard minimum-minutes threshold, and when would you prefer each?
  4. What is PSxG - GA, and why is it a better goalkeeper metric than raw save percentage?
  5. Describe the general shape of an age-performance curve and explain how peak age varies by position.
  6. What is survivorship bias, and how does it distort aging curve analysis?
  7. How do you construct a z-score-based radar chart, and what design principles make radar charts effective?
  8. Explain the difference between cosine similarity and Euclidean distance for player comparison. When would each be more appropriate?
  9. What is the coefficient of variation, and how does it measure player consistency?
  10. Describe how K-Means clustering can identify player archetypes within a position group.

Connections to Other Chapters

  • Chapter 3 (Statistics): Z-scores, percentiles, and regression underpin all metrics in this chapter.
  • Chapter 7 (xG): Non-penalty xG per 90 is a core forward evaluation metric.
  • Chapter 8 (xA): Expected assists drive creation metrics for midfielders and wingers.
  • Chapter 9 (Passing Networks): Progressive passes and pass completion derive from passing analysis.
  • Chapter 12 (Defensive Metrics): Tackles, interceptions, and pressures are inputs to defender profiles.
  • Chapter 16 (Team Analysis): Player-level ratings feed into squad depth and team chemistry models.

Looking Ahead

Chapter 16: Team Performance Analysis will shift the lens from individuals to collectives, building frameworks for team style fingerprinting, expected points models, squad depth analysis, team chemistry quantification, and Monte Carlo season simulations.


Keep this summary card handy as a reference while working through later chapters.