Edge Degradation | Trading Glass

No strategy lasts forever. Here's how to spot when your edge is fading — before your account does.

Prereq: Biases in Backtesting — without it you'll mistake an overfit strategy for a decaying one. Next: Outliers and Their Impact on Metrics — a single tail trade can fake a regime shift in your stability score.

TL;DR

Edge degradation is a statistically detectable decline in a strategy's expected value, caused by crowding, regime change, structural market change, or leakage discovery. Distinguish it from normal variance using a CUSUM chart or rolling t-test on per-trade R. When detected, choose between retire, refit, or wait based on the cause — not on emotion.

Introduction

Every system has a lifecycle.

It works great for months
Then it flatlines
Then the losses start piling up

Most traders:

Blame psychology
Start second-guessing
Or abandon it entirely (often too early or too late)

But smart traders detect edge degradation statistically — and adapt before a full breakdown.

This post shows you how.

What Is Edge Degradation?

Edge degradation means your once-profitable strategy:

No longer has positive EV
Underperforms in current market conditions
Has changed in win rate, R:R, or risk profile

Not due to a few random losses — but due to a statistical shift in the system’s performance.

Why Edges Decay (the mechanism, not the metaphor)

Markets are competitive systems. Any signal that produces excess return is a target. The decay process is information diffusion: one trader finds an inefficiency → a desk replicates it → an academic paper publishes the factor → a retail platform builds a screener for it → the original alpha is now priced in by 10,000 participants front-running each other into the trade. McLean & Pontiff (2016) measured this explicitly: published equity factor returns drop ~58% after publication. Your edge has the same fate unless you find one nobody else is hunting.

Published factor return drop post-publication

McLean & Pontiff (2016). The headline crowding-decay benchmark — once a factor is published, roughly three-fifths of its excess return vanishes as capital crowds in.

-58%

Four distinct decay mechanisms:

Crowding (slow, monotonic). More capital chases the same alpha; spread compresses. McLean & Pontiff (2016) found published factor returns drop ~58% post-publication.
Regime change (fast, often reversible). Vol regime flips: a mean-reversion strategy that prints in low-vol chop dies the day realized vol doubles.
Structural change (permanent). Tick-size reform, maker-taker fee changes, exchange microstructure updates — your 2019 backtest assumed a market that no longer exists.
Leakage discovered (instant). You find a look-ahead bug; the "edge" was never real.

Great traders don't just build edge — they monitor it continuously.

How do you detect edge degradation?

1. Track Rolling Metrics

Instead of only checking lifetime stats, track:

Rolling EV (e.g. last 30 or 50 trades)
Rolling win rate
Rolling average R:R
Rolling drawdown

Plot these over time to see statistical trends.

"My average EV dropped from +0.6R to +0.2R over the last 100 trades" → Time to slow down and re-evaluate.

2. Use Visual Breakpoints

Graph your PnL or equity curve.

Look for:

Plateaus
Trend shifts (up → flat → down)
New high-water marks taking much longer to hit
Increased variance with fewer new highs

These are often the early symptoms of a fading edge.

3. How do you tell if a strategy works only in one regime?

Segment your trades by:

Trend vs rangebound days
High vs low volatility
Market sessions (NY, Asia, London)
Crypto vs FX vs indices

You might find:

"This strategy worked well in high-volatility BTC days… but fails during chop."

Now you can filter or evolve it, not abandon it blindly.

How do you tell variance from edge decay?

Loss streak ≠ broken system. That's just normal distribution noise — see Outliers and Their Impact on Metrics for why a single tail trade can also masquerade as a regime change. And before trusting your in-sample numbers at all, audit your backtest for the failure modes covered in Biases in Backtesting.

A 55% win rate strategy will produce a 10-trade losing streak somewhere in 200 trades roughly 30% of the time. A 12-trade streak: ~18%. If you retire your system every time you see one, you will retire winning systems and keep replacing them with new untested ones — the trader's equivalent of shooting your own dog because it barked. Run the test before you reach for the trigger.

Long losing streaks are normal variance, not edge decay

At a 55% win rate, even a 12-trade losing streak shows up almost one time in five over a 200-trade window. Retiring on a streak retires winning systems.

Use a test, not a vibe:

CUSUM chart on rolling per-trade R: triggers when the cumulative deviation from in-sample mean breaches a threshold calibrated to your false-positive rate (Page 1954).
Welch's t-test comparing the last 50 trades' mean R against the prior 200. p < 0.05 is suggestive; p < 0.01 with effect size > 0.3σ is actionable.
EWMA control chart if you want faster detection at the cost of more false alarms.

Without a pre-committed test, you will either capitulate in week 3 of a normal drawdown or bag-hold a dead strategy for a year.

Signal	Variance	Decay
Loss streak length	Within bootstrap 95% CI of historical streaks	Beyond 99% CI
Rolling 50-trade EV	Oscillates around in-sample mean	Trends down monotonically over 100+ trades
Drawdown	< 1.5x worst in-sample	> 2x worst in-sample
Regime explanation	Performance constant across vol regimes	Performance bifurcates cleanly by regime
Welch's t-test (last 50 vs prior 200)	p > 0.10	p < 0.05

You're likely in edge degradation — and must adapt.

How to Respond to Edge Degradation

1. Reduce size (not confidence) temporarily

Cut size in proportion to your loss of confidence in EV. If your prior was +0.6R and rolling 50-trade EV is now +0.2R, you have ~1/3 the edge and Kelly says ~1/3 the size. Do this before a forced cut from drawdown — voluntary de-risking preserves both capital and the ability to think clearly.

2. Reassess performance by market regime

Don’t scrap everything. Instead:

Tag trades by volatility/trend/direction/time
Look for where edge still exists
Rebuild around what still works

3. Test small adjustments in isolation

Try:

Adjusting entries (delay confirmation?)
Changing exit logic (partial vs full?)
Adding filters (volume, time, range?)

Log these as separate strategies until they’re proven

4. Stop treating your system like a statue

Think of your system as:

A living framework you periodically tune — not a sacred formula you never touch.

Edge Stability Score

Rolling metrics and equity curve analysis tell you that something is changing, but the Edge Stability Score quantifies how consistently your edge has performed across the life of your trade log.

How It Works

Split your trades into 5 equal segments (chronological order). If you have 200 trades, each segment contains 40 trades.
Calculate the mean R per trade for each segment.
Compare the segment means to each other. The score measures how tightly clustered the five means are relative to your overall mean R.

The formula normalizes the standard deviation of segment means against the overall mean:

Edge Stability Score = 1 − (sigma_segments / |mu_R|)

sigma_segments = standard deviation of the 5 segment mean R values|mu_R| = absolute value of overall mean R across all tradesclamp = result clamped to the interval [0, 1]

This is an ad-hoc inverse coefficient-of-variation — useful as a triage flag, not a hypothesis test. The 0.85 / 0.65 / 0.50 cutoffs are heuristic thresholds tuned for ~200-trade logs; they have no closed-form distributional meaning. For inference, pair this with a CUSUM or t-test on the same segments.

Interpreting the Score

Edge Stability Score	Interpretation
0.85 - 1.0	Excellent. Your edge is remarkably consistent across time periods.
0.65 - 0.84	Good. Normal variation exists but edge persists throughout.
0.50 - 0.64	Marginal. Performance is noticeably uneven. Investigate which segments underperform and why.
Below 0.50	Concerning. Your "edge" may be concentrated in one or two favorable periods rather than being a true, durable advantage.

What a Low Stability Score Reveals

A stability score below 0.50 typically means one of the following:

Regime dependency -- your strategy only works in specific market conditions (trending, high volatility, etc.) and you are trading it indiscriminately.
Recency illusion -- a strong recent run is masking poor earlier performance, inflating your lifetime EV.
Curve-fit risk -- if you optimized parameters on a particular period, only that segment will look good.
One-off outlier -- a single outsized winner in one segment can pull its mean up and distort overall stats.

What to Do When Stability Is Low

Segment by regime, not just time. Tag each trade with market condition (trending, ranging, volatile, quiet). Recalculate stability within each regime to find where your edge actually lives.
Increase your segment count. Try 8 or 10 segments instead of 5. If the score improves, the instability was driven by one anomalous period. If it stays low, the inconsistency is structural.
Remove outlier trades and retest. Drop your top 2-3 winners and bottom 2-3 losers, then recalculate. If stability jumps significantly, your edge depends on rare events rather than repeatable execution.
Restrict your strategy. If only 2 of 5 segments are profitable, define the conditions those segments share and only trade when those conditions are present. A filtered strategy with a 0.85 stability score is far more trustworthy than an unfiltered one at 0.40.
Re-evaluate position sizing. Low stability means your risk of ruin is higher than your EV suggests. Reduce size until you can demonstrate consistency across multiple segments.

Edge Stability pairs powerfully with rolling EV and drawdown analysis. Two honest caveats: (1) with 200 trades, a Welch's t-test has low power to detect a 30% EV drop — you will miss real decay. (2) A high stability score on a curve-fit backtest means nothing; the optimizer was rewarded for producing exactly that. Stability scores are diagnostic on live trades, not on backtests.

Retire, Refit, or Wait?

Diagnosis	Statistical signal	Action
Normal variance	t-test p > 0.10, drawdown < 1.5x in-sample max	Wait. Do nothing.
Regime-attributable decay	Performance bifurcates cleanly by a regime tag (vol, trend)	Refit: add the regime as a filter, retest out-of-sample.
Crowding / publication	Slow monotonic decline over 200+ trades, no regime explains it	Retire. Crowding rarely reverses.
Structural change	Sharp break tied to a known event (fee change, listing, microstructure update)	Retire or rebuild from scratch — the old assumptions are dead.
Leakage discovered	Look-ahead or survivorship bug found in code	Retire immediately. The edge was never real.

Interactive: Equity Curve Under Edge Decay

Use the simulator to explore how different win rates and payoff ratios produce different equity shapes. An edge that is decaying will show the curve bending downward over time — compare high and low win rates to see this effect.

Equity Curve Simulator

Win Rate: 55%Payoff: 1.5:1

Final: $34281 (+242.8%)

FAQ

What is edge degradation in trading?

Edge degradation is a statistically detectable decline in a strategy's expected value. The strategy no longer has positive EV, underperforms current market conditions, or shows a measurable change in win rate, R:R, or risk profile — not from a few random losses, but from a real shift in the system's performance distribution.

How is edge decay different from a normal losing streak?

A losing streak is variance — a 55% win rate strategy will produce a 10-trade losing streak somewhere in 200 trades roughly 30% of the time. Edge decay is a sustained statistical shift: rolling 50-trade EV trends down monotonically, drawdown exceeds 2x your in-sample max, and a Welch's t-test of recent vs prior trades returns p < 0.05.

What should you do when your strategy starts losing its edge?

Diagnose the cause first, then decide. Normal variance: wait. Regime-attributable decay: refit with a regime filter and retest out-of-sample. Crowding or structural change: retire. Always cut size voluntarily in proportion to your loss of EV confidence before drawdown forces you to.

Final Thought

Strategies don't die -- they evolve, or they erode.

Your edge is not guaranteed forever. But with disciplined tracking and objective review, you can:

Extend its lifecycle
Avoid emotional quitting
Rebuild around truth, not fear

Don’t just trade the market. Trade your data.