Case Study 2: Edge Detection Across 500 Markets

Overview

This case study demonstrates a systematic, quantitative approach to finding edge across a large universe of prediction markets. Rather than analyzing individual markets one by one, we build an automated pipeline that scans 500 markets, identifies potential edge opportunities, filters by EV threshold, sizes positions with the Kelly criterion, and simulates portfolio outcomes. This approach is suited for traders who prefer a data-driven, systematic strategy over discretionary analysis.


Part 1: The Setup

The Universe

We simulate a universe of 500 prediction markets spanning multiple categories:

Category Count Description
Politics 120 Elections, legislation, appointments
Economics 100 Economic indicators, Fed decisions, GDP
Sports 80 Game outcomes, season totals, awards
Science/Tech 60 Product launches, scientific milestones
Entertainment 50 Awards shows, box office, ratings
Weather 40 Temperature records, storms, precipitation
Geopolitics 50 International events, treaties, conflicts

The Trader's Model

Our systematic trader (call him Alex) has built a quantitative model that produces probability estimates for each market. The model combines:

  1. Base rates from historical data (weighted 30%)
  2. Feature-based predictions from a logistic regression trained on past markets (weighted 40%)
  3. Market price itself, used as a signal (weighted 20%)
  4. Recency-adjusted trends capturing momentum in related data (weighted 10%)

Critically, the model is not equally good across all categories. Alex has domain expertise in economics and politics but limited knowledge of sports and entertainment. This means the model's edge varies by category.

Simulation Parameters

Parameter Value
Starting bankroll $100,000
Kelly fraction 0.25 (quarter Kelly)
Minimum edge threshold 3 percentage points
Maximum position size 5% of bankroll
Maximum total exposure 40% of bankroll
Trading fees 1% of position value
Simulation period 6 months

Part 2: The Scanning Pipeline

Step 1: Generate Probability Estimates

For each of the 500 markets, Alex's model produces: - A point estimate $\hat{q}$ - An uncertainty measure $\sigma_q$ (standard deviation of the estimate) - A confidence interval $[\hat{q} - 1.28\sigma_q, \hat{q} + 1.28\sigma_q]$ (80% CI)

Step 2: Compare to Market Prices

For each market, compute: - Raw edge: $\hat{q} - p$ (for YES) or $p - \hat{q}$ (for NO) - Best side: YES if $\hat{q} > p$, NO otherwise - Conservative edge: using the lower/upper bound of the CI as appropriate

Step 3: Filter by Edge Threshold

Only consider markets where the conservative edge (based on the 80% CI) exceeds the minimum threshold of 3 percentage points. This is a stringent filter that eliminates marginal opportunities where the model's uncertainty undermines the apparent edge.

Step 4: Compute Kelly Fractions

For each passing market, compute: - Full Kelly fraction - Quarter Kelly fraction (our chosen multiplier) - Capped at 5% of bankroll per position

Step 5: Portfolio Construction

Allocate capital to the highest-EV opportunities, subject to: - Maximum 40% total exposure - Maximum 5% per position - Diversification across categories (no more than 15% in any single category)


Part 3: Scanning Results

Raw Scan Output

Of 500 markets scanned:

Filter Stage Markets Remaining
Total universe 500
Model estimates edge > 0 (point estimate) 287
Conservative edge > 0 (80% CI lower bound) 158
Conservative edge > 3 pp 62
After liquidity filter (min volume > 100 contracts/day) 48
After correlation filter (remove highly correlated duplicates) 41

41 markets pass all filters, representing 8.2% of the universe. This hit rate is realistic -- the vast majority of markets are either fairly priced or have edge too small to be confidently exploited.

Distribution of Edge

Among the 41 passing markets:

Edge Range (pp) Count Avg EV%
3.0 - 5.0 18 7.2%
5.0 - 8.0 12 13.5%
8.0 - 12.0 7 19.8%
12.0 - 20.0 3 28.4%
20.0+ 1 52.1%

The distribution is right-skewed: most opportunities have modest edge (3-5 pp), with a few high-edge opportunities that disproportionately drive returns.

Edge by Category

Category Markets Passing Avg Edge (pp) Notes
Politics 12 8.4 Alex's strongest category
Economics 9 7.1 Strong model performance
Sports 4 4.2 Limited model accuracy
Science/Tech 6 5.8 Moderate edge
Entertainment 2 3.5 Weakest category
Weather 4 6.9 Base rate driven
Geopolitics 4 5.5 Moderate edge

Alex's edge is concentrated in politics and economics, consistent with his domain expertise. The model is weakest in entertainment and sports, where Alex has no informational advantage.


Part 4: Portfolio Construction

Capital Allocation

The 41 qualifying markets receive the following allocations:

Top 10 positions by allocated capital:

Rank Market Category Edge (pp) Kelly (quarter) Allocated
1 Fed rate decision Q3 Economics 14.2 8.1% $5,000
2 Governor race, State Y Politics 12.8 7.2% $5,000
3 Senate confirmation vote Politics 11.1 6.4% $5,000
4 GDP growth > 2.5% Economics 9.5 5.3% $5,000
5 Tech company IPO date Science/Tech 8.7 4.9% $4,900
6 Primary election, State Z Politics 8.3 4.6% $4,600
7 January temp record Weather 7.9 4.4% $4,400
8 Unemployment below 4% Economics 7.2 4.0% $4,000
9 Bill passage by EOY Politics 6.8 3.8% $3,800
10 Championship series winner Sports 6.1 3.4% $3,400

Portfolio summary:

Metric Value
Number of positions 41
Total capital allocated $38,400
Percent of bankroll deployed 38.4%
Average position size $936
Median position size $780
Largest position $5,000 (5% cap)
Smallest position $310
Capital held in reserve $61,600

Portfolio Diversification

Category Capital Allocated % of Portfolio
Politics $14,200 37.0%
Economics $10,100 26.3%
Science/Tech $4,900 12.8%
Weather $3,800 9.9%
Geopolitics $2,800 7.3%
Sports $1,600 4.2%
Entertainment $1,000 2.6%

The portfolio is concentrated in politics and economics, reflecting Alex's comparative advantage. While this reduces diversification, it concentrates capital where edge is highest.


Part 5: Simulation Results

We simulate the portfolio's performance over 6 months using Monte Carlo methods (1,000 simulation runs). In each run, markets resolve randomly according to the "true" probabilities (which we set to be close to Alex's estimates, with some noise to reflect model imperfection).

Model Accuracy Assumptions

To make the simulation realistic, we assume Alex's model is imperfect:

  • For each market, the "true" probability is drawn from a distribution centered on Alex's estimate with standard deviation $\sigma_q$ (the model's own uncertainty estimate)
  • On average, Alex's estimates are biased slightly toward the market price (the model is not perfectly independent of market information)
  • The model is better calibrated in politics/economics and worse in sports/entertainment

Monte Carlo Results (1,000 Runs)

Metric Value
Mean final bankroll $106,840
Median final bankroll $106,200
5th percentile $97,100
95th percentile $117,800
Mean return +6.84%
Median return +6.20%
Probability of profit 87.3%
Probability of > 10% return 28.4%
Probability of loss > 5% 3.1%
Mean max drawdown 5.2%
Worst-case max drawdown (95th percentile) 11.8%
Sharpe ratio (annualized) 1.42

Interpretation

The portfolio has an 87.3% probability of being profitable over 6 months, with a mean return of 6.84%. This corresponds to an annualized return of approximately 14%, which is attractive for a strategy with limited drawdown.

The 5th percentile outcome (losing about 3%) is manageable. The worst-case drawdown of 11.8% (at 95th percentile) is well within tolerance.

The Sharpe ratio of 1.42 is strong, comparable to top-tier quantitative hedge fund strategies. This is partly because prediction markets have returns that are less correlated with traditional financial markets, which reduces portfolio-level risk.

What Drives Returns

Decomposing the mean return across categories:

Category Contribution to Return % of Total Return
Politics +$3,180 46.4%
Economics +$2,050 29.9%
Science/Tech +$620 9.1%
Weather +$450 6.6%
Geopolitics +$320 4.7%
Sports +$140 2.0%
Entertainment +$80 1.2%

Politics and economics together account for 76.3% of returns while using 63.3% of allocated capital. This confirms that Alex should concentrate on these categories.

The Impact of Filtering

To demonstrate the importance of edge filtering, we compare the full-portfolio strategy to alternatives:

Strategy Mean Return Prob of Profit Max Drawdown
Full pipeline (41 markets, edge > 3pp) +6.84% 87.3% 5.2%
All 287 markets with any positive edge +1.20% 61.2% 12.8%
All 500 markets (random) -1.40% 38.5% 18.3%
Only top 10 by edge +4.50% 82.1% 8.7%

The full pipeline dramatically outperforms trading all markets. Trading all 500 markets randomly (no model) produces negative returns due to fees. Trading only the top 10 markets by edge gives good returns but with more concentration risk.

The edge filter is critical. Without it, the many marginal trades (with edge near zero or negative) dilute the portfolio's returns and increase risk.


Part 6: Sensitivity Analysis

Sensitivity to Kelly Fraction

Kelly Fraction Mean Return Prob of Profit Mean Max Drawdown
Full Kelly (1.0) +18.5% 89.1% 22.4%
Three-quarter Kelly (0.75) +15.2% 88.8% 16.1%
Half Kelly (0.50) +11.1% 88.5% 10.2%
Quarter Kelly (0.25) +6.84% 87.3% 5.2%
Tenth Kelly (0.10) +2.8% 83.1% 2.1%

Full Kelly offers the highest returns but with a 22.4% average drawdown. Quarter Kelly provides a good balance of returns and risk management. The probability of profit is fairly stable across fractions, but drawdown risk increases significantly at higher fractions.

Sensitivity to Edge Threshold

Minimum Edge (pp) Markets Passing Mean Return Sharpe Ratio
0 (any positive) 287 +1.20% 0.35
1 198 +2.80% 0.62
2 112 +4.50% 0.95
3 62 +6.84% 1.42
5 23 +5.20% 1.68
8 11 +3.80% 1.85
12 4 +2.10% 1.75

There is an optimal trade-off: very low thresholds include too many marginal trades (diluting returns), while very high thresholds exclude too many trades (limiting the number of bets and increasing concentration risk). For this model and universe, a threshold around 3-5 pp maximizes the Sharpe ratio.

Sensitivity to Model Accuracy

We vary the model's accuracy by adjusting how much noise is added to Alex's estimates:

Model Quality Mean Estimate Error Mean Return Prob of Profit
Excellent (low noise) 3.0 pp +10.2% 92.1%
Good (baseline) 5.0 pp +6.84% 87.3%
Fair (moderate noise) 8.0 pp +3.1% 72.5%
Poor (high noise) 12.0 pp -0.5% 48.2%
Random (no skill) 20.0 pp -2.8% 32.1%

Model quality is the primary driver of profitability. A mediocre model with high noise produces negative returns even with Kelly sizing and edge filtering. This underscores the importance of investing in model development and honest assessment of model accuracy.


Part 7: Practical Considerations

Execution Challenges

The simulation assumes perfect execution (trading at the observed market price). In reality:

  1. Slippage: Large orders move the price, especially in thin markets. Alex should break large orders into smaller pieces.
  2. Latency: By the time Alex acts on his model's output, the market may have moved. Real-time integration with market APIs reduces this risk.
  3. Market availability: Not all 500 markets may be available on a single platform. Cross-platform trading introduces additional complexity.

Correlation Risk

The simulation treats all markets as independent, but in reality:

  • Political markets in the same election cycle are correlated
  • Economic markets respond to common macro factors
  • Multiple sports markets may depend on the same underlying team or player performance

Ignoring correlations leads to overexposure. Alex should group correlated markets and apply position limits at the group level, not just the individual market level.

Rebalancing

As markets resolve and new markets appear, the portfolio must be rebalanced: - Resolved markets free up capital for new opportunities - New markets should be scanned and added if they pass the edge filter - Existing positions should be reviewed if prices move significantly (edge may shrink or grow)

A weekly rebalancing cycle is practical for most prediction market traders. More active traders might rebalance daily.

Scaling Challenges

As bankroll grows, some markets become too small (insufficient liquidity) to absorb larger positions. Alex may need to: - Expand to additional platforms - Trade higher-liquidity markets (which tend to have smaller edges) - Accept lower returns per dollar as the strategy scales


Part 8: Key Takeaways

  1. Systematic scanning dramatically outperforms discretionary trading. By evaluating 500 markets rather than relying on a handful of gut-feel trades, Alex identifies more opportunities and avoids selection bias.

  2. Edge filtering is essential. Most markets do not offer meaningful edge. Trading everything dilutes returns. A 3-5 percentage point minimum edge threshold balances opportunity capture with quality.

  3. Concentrate on your comparative advantage. Alex's edge is concentrated in politics and economics. Rather than trying to trade every category, he should focus where his model performs best.

  4. Position sizing protects against model failure. Quarter Kelly with position caps limits drawdowns even when the model is wrong. This is especially important for a systematic strategy where errors can be correlated.

  5. Model quality is the bottleneck. All the fancy portfolio construction in the world cannot compensate for poor probability estimates. Invest in model accuracy above all else.

  6. The pipeline is more valuable than any single trade. The systematic process -- scanning, filtering, sizing, tracking -- generates consistent returns. No single trade matters much. What matters is the process repeated over hundreds of trades.


Code

The complete simulation code for this case study is available in code/case-study-code.py. It includes: - Market universe generation - Model simulation with configurable accuracy - Edge scanning and filtering pipeline - Kelly-based portfolio construction - Monte Carlo portfolio simulation - Sensitivity analysis across key parameters - Visualization of results