Case Study 2: Edge Detection Across 500 Markets
Overview
This case study demonstrates a systematic, quantitative approach to finding edge across a large universe of prediction markets. Rather than analyzing individual markets one by one, we build an automated pipeline that scans 500 markets, identifies potential edge opportunities, filters by EV threshold, sizes positions with the Kelly criterion, and simulates portfolio outcomes. This approach is suited for traders who prefer a data-driven, systematic strategy over discretionary analysis.
Part 1: The Setup
The Universe
We simulate a universe of 500 prediction markets spanning multiple categories:
| Category | Count | Description |
|---|---|---|
| Politics | 120 | Elections, legislation, appointments |
| Economics | 100 | Economic indicators, Fed decisions, GDP |
| Sports | 80 | Game outcomes, season totals, awards |
| Science/Tech | 60 | Product launches, scientific milestones |
| Entertainment | 50 | Awards shows, box office, ratings |
| Weather | 40 | Temperature records, storms, precipitation |
| Geopolitics | 50 | International events, treaties, conflicts |
The Trader's Model
Our systematic trader (call him Alex) has built a quantitative model that produces probability estimates for each market. The model combines:
- Base rates from historical data (weighted 30%)
- Feature-based predictions from a logistic regression trained on past markets (weighted 40%)
- Market price itself, used as a signal (weighted 20%)
- Recency-adjusted trends capturing momentum in related data (weighted 10%)
Critically, the model is not equally good across all categories. Alex has domain expertise in economics and politics but limited knowledge of sports and entertainment. This means the model's edge varies by category.
Simulation Parameters
| Parameter | Value |
|---|---|
| Starting bankroll | $100,000 |
| Kelly fraction | 0.25 (quarter Kelly) |
| Minimum edge threshold | 3 percentage points |
| Maximum position size | 5% of bankroll |
| Maximum total exposure | 40% of bankroll |
| Trading fees | 1% of position value |
| Simulation period | 6 months |
Part 2: The Scanning Pipeline
Step 1: Generate Probability Estimates
For each of the 500 markets, Alex's model produces: - A point estimate $\hat{q}$ - An uncertainty measure $\sigma_q$ (standard deviation of the estimate) - A confidence interval $[\hat{q} - 1.28\sigma_q, \hat{q} + 1.28\sigma_q]$ (80% CI)
Step 2: Compare to Market Prices
For each market, compute: - Raw edge: $\hat{q} - p$ (for YES) or $p - \hat{q}$ (for NO) - Best side: YES if $\hat{q} > p$, NO otherwise - Conservative edge: using the lower/upper bound of the CI as appropriate
Step 3: Filter by Edge Threshold
Only consider markets where the conservative edge (based on the 80% CI) exceeds the minimum threshold of 3 percentage points. This is a stringent filter that eliminates marginal opportunities where the model's uncertainty undermines the apparent edge.
Step 4: Compute Kelly Fractions
For each passing market, compute: - Full Kelly fraction - Quarter Kelly fraction (our chosen multiplier) - Capped at 5% of bankroll per position
Step 5: Portfolio Construction
Allocate capital to the highest-EV opportunities, subject to: - Maximum 40% total exposure - Maximum 5% per position - Diversification across categories (no more than 15% in any single category)
Part 3: Scanning Results
Raw Scan Output
Of 500 markets scanned:
| Filter Stage | Markets Remaining |
|---|---|
| Total universe | 500 |
| Model estimates edge > 0 (point estimate) | 287 |
| Conservative edge > 0 (80% CI lower bound) | 158 |
| Conservative edge > 3 pp | 62 |
| After liquidity filter (min volume > 100 contracts/day) | 48 |
| After correlation filter (remove highly correlated duplicates) | 41 |
41 markets pass all filters, representing 8.2% of the universe. This hit rate is realistic -- the vast majority of markets are either fairly priced or have edge too small to be confidently exploited.
Distribution of Edge
Among the 41 passing markets:
| Edge Range (pp) | Count | Avg EV% |
|---|---|---|
| 3.0 - 5.0 | 18 | 7.2% |
| 5.0 - 8.0 | 12 | 13.5% |
| 8.0 - 12.0 | 7 | 19.8% |
| 12.0 - 20.0 | 3 | 28.4% |
| 20.0+ | 1 | 52.1% |
The distribution is right-skewed: most opportunities have modest edge (3-5 pp), with a few high-edge opportunities that disproportionately drive returns.
Edge by Category
| Category | Markets Passing | Avg Edge (pp) | Notes |
|---|---|---|---|
| Politics | 12 | 8.4 | Alex's strongest category |
| Economics | 9 | 7.1 | Strong model performance |
| Sports | 4 | 4.2 | Limited model accuracy |
| Science/Tech | 6 | 5.8 | Moderate edge |
| Entertainment | 2 | 3.5 | Weakest category |
| Weather | 4 | 6.9 | Base rate driven |
| Geopolitics | 4 | 5.5 | Moderate edge |
Alex's edge is concentrated in politics and economics, consistent with his domain expertise. The model is weakest in entertainment and sports, where Alex has no informational advantage.
Part 4: Portfolio Construction
Capital Allocation
The 41 qualifying markets receive the following allocations:
Top 10 positions by allocated capital:
| Rank | Market | Category | Edge (pp) | Kelly (quarter) | Allocated |
|---|---|---|---|---|---|
| 1 | Fed rate decision Q3 | Economics | 14.2 | 8.1% | $5,000 |
| 2 | Governor race, State Y | Politics | 12.8 | 7.2% | $5,000 |
| 3 | Senate confirmation vote | Politics | 11.1 | 6.4% | $5,000 |
| 4 | GDP growth > 2.5% | Economics | 9.5 | 5.3% | $5,000 |
| 5 | Tech company IPO date | Science/Tech | 8.7 | 4.9% | $4,900 |
| 6 | Primary election, State Z | Politics | 8.3 | 4.6% | $4,600 |
| 7 | January temp record | Weather | 7.9 | 4.4% | $4,400 |
| 8 | Unemployment below 4% | Economics | 7.2 | 4.0% | $4,000 |
| 9 | Bill passage by EOY | Politics | 6.8 | 3.8% | $3,800 |
| 10 | Championship series winner | Sports | 6.1 | 3.4% | $3,400 |
Portfolio summary:
| Metric | Value |
|---|---|
| Number of positions | 41 |
| Total capital allocated | $38,400 |
| Percent of bankroll deployed | 38.4% |
| Average position size | $936 |
| Median position size | $780 |
| Largest position | $5,000 (5% cap) |
| Smallest position | $310 |
| Capital held in reserve | $61,600 |
Portfolio Diversification
| Category | Capital Allocated | % of Portfolio |
|---|---|---|
| Politics | $14,200 | 37.0% |
| Economics | $10,100 | 26.3% |
| Science/Tech | $4,900 | 12.8% |
| Weather | $3,800 | 9.9% |
| Geopolitics | $2,800 | 7.3% |
| Sports | $1,600 | 4.2% |
| Entertainment | $1,000 | 2.6% |
The portfolio is concentrated in politics and economics, reflecting Alex's comparative advantage. While this reduces diversification, it concentrates capital where edge is highest.
Part 5: Simulation Results
We simulate the portfolio's performance over 6 months using Monte Carlo methods (1,000 simulation runs). In each run, markets resolve randomly according to the "true" probabilities (which we set to be close to Alex's estimates, with some noise to reflect model imperfection).
Model Accuracy Assumptions
To make the simulation realistic, we assume Alex's model is imperfect:
- For each market, the "true" probability is drawn from a distribution centered on Alex's estimate with standard deviation $\sigma_q$ (the model's own uncertainty estimate)
- On average, Alex's estimates are biased slightly toward the market price (the model is not perfectly independent of market information)
- The model is better calibrated in politics/economics and worse in sports/entertainment
Monte Carlo Results (1,000 Runs)
| Metric | Value |
|---|---|
| Mean final bankroll | $106,840 |
| Median final bankroll | $106,200 |
| 5th percentile | $97,100 |
| 95th percentile | $117,800 |
| Mean return | +6.84% |
| Median return | +6.20% |
| Probability of profit | 87.3% |
| Probability of > 10% return | 28.4% |
| Probability of loss > 5% | 3.1% |
| Mean max drawdown | 5.2% |
| Worst-case max drawdown (95th percentile) | 11.8% |
| Sharpe ratio (annualized) | 1.42 |
Interpretation
The portfolio has an 87.3% probability of being profitable over 6 months, with a mean return of 6.84%. This corresponds to an annualized return of approximately 14%, which is attractive for a strategy with limited drawdown.
The 5th percentile outcome (losing about 3%) is manageable. The worst-case drawdown of 11.8% (at 95th percentile) is well within tolerance.
The Sharpe ratio of 1.42 is strong, comparable to top-tier quantitative hedge fund strategies. This is partly because prediction markets have returns that are less correlated with traditional financial markets, which reduces portfolio-level risk.
What Drives Returns
Decomposing the mean return across categories:
| Category | Contribution to Return | % of Total Return |
|---|---|---|
| Politics | +$3,180 | 46.4% |
| Economics | +$2,050 | 29.9% |
| Science/Tech | +$620 | 9.1% |
| Weather | +$450 | 6.6% |
| Geopolitics | +$320 | 4.7% |
| Sports | +$140 | 2.0% |
| Entertainment | +$80 | 1.2% |
Politics and economics together account for 76.3% of returns while using 63.3% of allocated capital. This confirms that Alex should concentrate on these categories.
The Impact of Filtering
To demonstrate the importance of edge filtering, we compare the full-portfolio strategy to alternatives:
| Strategy | Mean Return | Prob of Profit | Max Drawdown |
|---|---|---|---|
| Full pipeline (41 markets, edge > 3pp) | +6.84% | 87.3% | 5.2% |
| All 287 markets with any positive edge | +1.20% | 61.2% | 12.8% |
| All 500 markets (random) | -1.40% | 38.5% | 18.3% |
| Only top 10 by edge | +4.50% | 82.1% | 8.7% |
The full pipeline dramatically outperforms trading all markets. Trading all 500 markets randomly (no model) produces negative returns due to fees. Trading only the top 10 markets by edge gives good returns but with more concentration risk.
The edge filter is critical. Without it, the many marginal trades (with edge near zero or negative) dilute the portfolio's returns and increase risk.
Part 6: Sensitivity Analysis
Sensitivity to Kelly Fraction
| Kelly Fraction | Mean Return | Prob of Profit | Mean Max Drawdown |
|---|---|---|---|
| Full Kelly (1.0) | +18.5% | 89.1% | 22.4% |
| Three-quarter Kelly (0.75) | +15.2% | 88.8% | 16.1% |
| Half Kelly (0.50) | +11.1% | 88.5% | 10.2% |
| Quarter Kelly (0.25) | +6.84% | 87.3% | 5.2% |
| Tenth Kelly (0.10) | +2.8% | 83.1% | 2.1% |
Full Kelly offers the highest returns but with a 22.4% average drawdown. Quarter Kelly provides a good balance of returns and risk management. The probability of profit is fairly stable across fractions, but drawdown risk increases significantly at higher fractions.
Sensitivity to Edge Threshold
| Minimum Edge (pp) | Markets Passing | Mean Return | Sharpe Ratio |
|---|---|---|---|
| 0 (any positive) | 287 | +1.20% | 0.35 |
| 1 | 198 | +2.80% | 0.62 |
| 2 | 112 | +4.50% | 0.95 |
| 3 | 62 | +6.84% | 1.42 |
| 5 | 23 | +5.20% | 1.68 |
| 8 | 11 | +3.80% | 1.85 |
| 12 | 4 | +2.10% | 1.75 |
There is an optimal trade-off: very low thresholds include too many marginal trades (diluting returns), while very high thresholds exclude too many trades (limiting the number of bets and increasing concentration risk). For this model and universe, a threshold around 3-5 pp maximizes the Sharpe ratio.
Sensitivity to Model Accuracy
We vary the model's accuracy by adjusting how much noise is added to Alex's estimates:
| Model Quality | Mean Estimate Error | Mean Return | Prob of Profit |
|---|---|---|---|
| Excellent (low noise) | 3.0 pp | +10.2% | 92.1% |
| Good (baseline) | 5.0 pp | +6.84% | 87.3% |
| Fair (moderate noise) | 8.0 pp | +3.1% | 72.5% |
| Poor (high noise) | 12.0 pp | -0.5% | 48.2% |
| Random (no skill) | 20.0 pp | -2.8% | 32.1% |
Model quality is the primary driver of profitability. A mediocre model with high noise produces negative returns even with Kelly sizing and edge filtering. This underscores the importance of investing in model development and honest assessment of model accuracy.
Part 7: Practical Considerations
Execution Challenges
The simulation assumes perfect execution (trading at the observed market price). In reality:
- Slippage: Large orders move the price, especially in thin markets. Alex should break large orders into smaller pieces.
- Latency: By the time Alex acts on his model's output, the market may have moved. Real-time integration with market APIs reduces this risk.
- Market availability: Not all 500 markets may be available on a single platform. Cross-platform trading introduces additional complexity.
Correlation Risk
The simulation treats all markets as independent, but in reality:
- Political markets in the same election cycle are correlated
- Economic markets respond to common macro factors
- Multiple sports markets may depend on the same underlying team or player performance
Ignoring correlations leads to overexposure. Alex should group correlated markets and apply position limits at the group level, not just the individual market level.
Rebalancing
As markets resolve and new markets appear, the portfolio must be rebalanced: - Resolved markets free up capital for new opportunities - New markets should be scanned and added if they pass the edge filter - Existing positions should be reviewed if prices move significantly (edge may shrink or grow)
A weekly rebalancing cycle is practical for most prediction market traders. More active traders might rebalance daily.
Scaling Challenges
As bankroll grows, some markets become too small (insufficient liquidity) to absorb larger positions. Alex may need to: - Expand to additional platforms - Trade higher-liquidity markets (which tend to have smaller edges) - Accept lower returns per dollar as the strategy scales
Part 8: Key Takeaways
-
Systematic scanning dramatically outperforms discretionary trading. By evaluating 500 markets rather than relying on a handful of gut-feel trades, Alex identifies more opportunities and avoids selection bias.
-
Edge filtering is essential. Most markets do not offer meaningful edge. Trading everything dilutes returns. A 3-5 percentage point minimum edge threshold balances opportunity capture with quality.
-
Concentrate on your comparative advantage. Alex's edge is concentrated in politics and economics. Rather than trying to trade every category, he should focus where his model performs best.
-
Position sizing protects against model failure. Quarter Kelly with position caps limits drawdowns even when the model is wrong. This is especially important for a systematic strategy where errors can be correlated.
-
Model quality is the bottleneck. All the fancy portfolio construction in the world cannot compensate for poor probability estimates. Invest in model accuracy above all else.
-
The pipeline is more valuable than any single trade. The systematic process -- scanning, filtering, sizing, tracking -- generates consistent returns. No single trade matters much. What matters is the process repeated over hundreds of trades.
Code
The complete simulation code for this case study is available in code/case-study-code.py. It includes:
- Market universe generation
- Model simulation with configurable accuracy
- Edge scanning and filtering pipeline
- Kelly-based portfolio construction
- Monte Carlo portfolio simulation
- Sensitivity analysis across key parameters
- Visualization of results