Chapter 21: Key Takeaways - Modeling Combat Sports and Tennis
-
Elo and Glicko-2 systems provide the foundational backbone for individual sports modeling. The Elo system's simplicity makes it a powerful starting point, but sport-specific calibration of the K-factor is critical. MMA requires K-factors of 100-200 due to infrequent competition, while tennis performs best at 20-32. Glicko-2 adds rating deviation and volatility parameters that naturally handle irregular competition schedules, making it particularly well-suited for combat sports.
-
A single rating is insufficient for tennis; surface-specific ratings are essential. A player's ability on clay may differ by 100+ Elo points from their ability on grass. Surface-adjusted Elo systems that blend overall and surface-specific ratings using a sample-size-dependent weight produce significantly better predictions than overall-only ratings, especially during surface transitions at the start of the clay or grass season.
-
Style matchups systematically modify baseline rating predictions in measurable ways. The boxing proverb "styles make fights" can be quantified through a matchup matrix that encodes the log-odds adjustment for each style pairing. Wrestlers consistently overperform their Elo against strikers and counter-strikers. These adjustments are small (typically 2-6 percentage points) but persistent and valuable for identifying mispriced fights.
-
Physical attributes in combat sports provide quantifiable edges beyond what ratings capture. Reach advantages of 3+ inches add approximately 1-2 percentage points per inch to win probability. Age curves follow a predictable decline after age 30 that accelerates nonlinearly. Severe weight cuts and accumulated knockout vulnerability provide additional predictive signals that the market frequently underweights.
-
Live win probability models for tennis can be computed exactly from serve-win probabilities. Tennis's hierarchical scoring structure (points, games, sets, match) allows recursive computation of exact win probabilities at any score state. The key is Bayesian updating of serve-point-win probabilities as the match progresses, incorporating in-match performance while maintaining appropriate prior influence.
-
MMA live modeling requires tracking three simultaneous components: finish probability, scorecard accumulation, and round-by-round evolution. Unlike tennis, where scoring is deterministic, MMA must account for the ever-present possibility of a fight-ending event (KO, submission, stoppage) alongside the 10-point-must scoring system. Knockdowns and submission attempts dramatically increase within-round finish probability.
-
Inactivity handling separates good individual sports models from mediocre ones. Fighters and players returning from layoffs present a unique challenge. Glicko-2's natural rating deviation growth is elegant, but even simpler Elo systems benefit from inactivity decay mechanisms that increase uncertainty rather than simply penalizing the returning competitor.
-
Indoor/outdoor and altitude effects in tennis are significant and often mispriced. Indoor hard courts play faster than outdoor, benefiting serve-dominant players. High-altitude venues reduce air drag by up to 27%, making serves harder to return and topspin less effective. These environmental factors create systematic edges during indoor seasons and at elevation tournaments.
-
Promotion-tier initialization and informed priors dramatically accelerate Elo convergence. A UFC debutant from a top regional promotion should not start at the same rating as a minor regional fighter. Promotion-based initialization and Bayesian prior information reduce the number of fights needed for ratings to stabilize, which is critical in sports where fighters may only compete 2-3 times per year.
-
The chin deterioration phenomenon in combat sports is irreversible, progressive, and modelable. Career KO/TKO losses, total significant strikes absorbed, and age combine to produce a knockout vulnerability index that predicts future finish susceptibility. This is one of the strongest and most underappreciated signals in combat sports modeling.
-
Individual sports markets are structurally less efficient than major team sports markets. The combination of heterogeneous outcome determinants, less sharp action, frequent competition (in tennis), and the complexity of style/surface/physical interactions creates persistent edges for disciplined quantitative bettors. Lower-tier events and Challenger-level tennis offer the widest market inefficiencies.
-
A complete individual sports betting system integrates all components: ratings, matchups, surface/venue, physical attributes, and live modeling. No single component is sufficient on its own. The Elo/Glicko-2 rating provides the backbone, style matchup analysis adds 2-6 percentage points of adjustment, surface effects add another 3-8 points in extreme cases, and physical attributes contribute additional edge. The live model extends the same framework into the match itself.