From Review to Forecasting

Real traders don’t just look back at trades. They project forward with sharper clarity, based on what their own edge is telling them.

Introduction

Read this first. This lesson uses forecasting in one specific sense only: projecting your own future performance metrics — winrate, expectancy, drawdown, losing-streak length — from your historical trade log. It does not teach market prediction. If you are looking for a way to predict next week's price, this lesson will not help you, and nothing in trading reliably will.

Forecasting in trading is the statistical projection of your own future performance distribution from your historical trade log. Given a 200-trade sample at 53% winrate, you can forecast that 8-trade losing streaks will occur roughly once per quarter. You cannot forecast next Tuesday's BTC range. Confuse these two and the rest of this lesson will hurt you.

Type	Input	Output	Falsifiable?	Useful for sizing?
Market forecast	Price/structure	Next price	Rarely	No
Self forecast	Your trade log	Your next 100-trade R-band	Yes (compare actual to forecast)	Yes
Setup ranking	Per-setup expectancy	Hypotheses to test	After more N	Indirectly

Prerequisites. You need a working trade feedback loop before you can forecast — without consistent review you have no clean sample to project from. The other anchor is base-rate literacy from the probability module; forecasting without it collapses into prediction.

What this lesson is not

This is not market forecasting. You cannot predict price. The Efficient Market Hypothesis is wrong in detail but right in spirit for retail timeframes: your edge does not come from knowing where price goes. It comes from knowing your own statistical signature — winrate, expectancy, drawdown distribution — and sizing accordingly. Confusing these two is the most expensive mistake in retail.

A practical consequence: every "forecast" in this lesson carries a sample size and a confidence interval. A claim without those is not a forecast — it is a guess.

Step 1: Extract base rates from your trade log

Look at your last 50–200 trades — but treat anything below N=50 as a hypothesis, not a forecast. With N=25 the 95% confidence interval on winrate is roughly ±20 percentage points. You are not forecasting yet; you are generating candidates to test on the next 100 trades.

1a. Compute per-setup winrate with confidence interval

For each setup, compute the Wilson 95% confidence interval on its winrate, not the naive ratio. The Wilson interval is the standard binomial-proportion CI and avoids the boundary problems of the naive Wald interval at small N. Treat the lower bound as your honest estimate.

1b. Compute per-setup expectancy in R

Expectancy per trade = (winrate × average win in R) − (loss rate × average loss in R). This is the only number that tells you whether the setup makes money on average. Setup ranking by raw winrate is the textbook small-sample error.

1c. Flag setups where N < 30 as hypotheses, not forecasts

Wilson 95% CI half-width on winrate shrinks with sample size

Diminishing returns: doubling N from 100 to 200 only narrows the band from 10pp to 7pp.

These questions surface which setups have produced positive expectancy in your sample — not where you will win next. With 20–30 trades the confidence interval on any single setup's winrate spans ±15–20pp; treat rankings as hypotheses, not forecasts.

Insight	What it actually tells you
What setup had the best R-multiple?	Candidate for more reps, but only if N ≥ 50
What setup caused the most frustration?	Behavioral signal — drop or reduce size, regardless of WR
Where did emotion most interfere?	Identify the trigger context; do not treat it as a forecast input
What structure repeats most often?	Sample-size pump — gives you faster CI convergence

Step 2: Build a forward trade bias map

The bias map is a plan, not a forecast. It tells you which hypotheses you are willing to risk capital on for the next 10–20 trades, and at what size. It is grounded in Step 1 numbers.

Area	Example	Numeric anchor
High-conviction setup	OB + BOS + imbalance confluence	N=72, WR 58% (CI 47–69%), expectancy +0.42R
Setup under review	Sweep without BOS	N=18, WR 39% — too small to act, hypothesis only
Avoid for now	Fade against HTF structure	N=24, expectancy −0.15R
Mental focus	Patience at open, no impulse trades	Plan-follow target ≥ 80%
Performance target	Plan-follow rate, not R-multiple	≥ 80% over next 20 trades

This becomes your trading compass for the next 10–20 trades. Each row is a falsifiable claim.

Step 3: Build a setup forecast scorecard

A forecast card is a setup spec card plus base rates. Without sample size and confidence interval, this is a setup spec — useful, but not a forecast. Add the fields a forecast actually requires.

Worked example — BTC OB Reclaim Long

LONGExample Trade

Entry

Limit at OB with wick buffer (sweep then BOS then OB retest)

Stop Loss

Below swing low or 1.2x ATR

Take Profit

40% at 2R, trail rest

R:R

0.30R expectancy / trade

HTF bias bullish (higher-low above support). Invalidation: no BOS after sweep means no entry. Sample drawn from Q4-2025 medium-volatility regime; re-validate if ATR percentile shifts more than 1 sigma.

N=47 trades, historical winrate 26/47 = 55.3% (Wilson 95% CI 41 to 69 percent). Average win +1.4R, average loss -1.0R. Expected max losing streak (95th percentile) ceil(log(0.05) / log(1 - WR)) approximately 4 trades; longest streak in 100 trades roughly 7 trades.

Expectancy per trade

(WR x avgWin) minus (LR x avgLoss), measured on the N=47 sample.

0.30R

E[R] = 0.553 * 1.4 + 0.447 * -1.0 = 0.30R

Expected max losing streak

95th percentile streak length at WR 55.3 percent. Plan-grade, not worst-case.

~4 trades

ceil(log(0.05) / log(1 - 0.553)) approximately 4

Longest streak in 100 trades

Plan for at least one streak this long over the next 100 trades. If your sizing cannot accept it, the sizing is wrong, not the setup.

~7 trades

Cumulative R over next 100 trades

Expectancy x trade count. The 1 sigma envelope is the band you should expect to live inside; outside the band, regime change is the working hypothesis.

~30R (1 sigma band ~+/-25R)

Field	Spec card (insufficient)	Forecast card (this lesson)
Entry / stop / partial logic	Yes	Yes
Sample size N	No	Required
Historical WR with 95% CI	No	Required
Expectancy in R	No	Required
Expected max losing streak	No	Required
Regime tag	No	Required

The scorecard becomes your repeatable forward-play instrument — not just a memory.

Weekly forecasting ritual

Each Sunday or Monday, run a checklist that produces numbers:

Recompute setup-level expectancy: (WR × avgWin) − (LR × avgLoss).
Recompute Wilson 95% CI on WR — if the lower bound is below 0% expectancy, the sample is too small to act on; demote to hypothesis.
Forecast next-week R-band as expectancy × planned trade count ± 2σ.
Pre-commit: trade only if executed plan-follow rate over the prior week was greater than 80%.
Tag the active regime (volatility percentile, trend / range / chop). If it shifts, mark forecasts as out-of-sample.

"I will take only 1 OB-reclaim setup per session. I will pre-plan entry/exit levels. I will not chase price outside my zone."

The ritual is not motivation — it is calibration. The forecast bands here become the expected envelope for equity curve analysis; when actual equity breaches the forecast band, regime change is the working hypothesis.

Frequently asked questions

How many trades do I need before I can forecast my performance?

At least 50 for a directional hypothesis, 100 for sizing-grade decisions, and 200+ for forecast-grade claims with a usable confidence interval. Below N=50 the Wilson 95% CI on winrate is wide enough (±14pp or worse) that ranking setups against each other is statistically meaningless.

Can I forecast which setups will keep working next month?

No. You can rank setups by historical expectancy and size positions proportional to confidence — that is not the same as predicting they will continue to work. Setups decay when the regime that supported them shifts; the forecast is a null hypothesis to be falsified by next month's data, not a guarantee.

What if my last 20 trades all worked?

Recency bias plus small-sample noise. With N=20, even a 70% observed winrate has a Wilson lower bound near 46%. Twenty consecutive winners is consistent with a true 50% winrate over a long enough run. Do not increase size on the basis of a 20-trade stretch.

What goes on a setup forecast card that a setup spec card lacks?

Sample size N, historical winrate with 95% confidence interval, expectancy in R, expected maximum losing streak, and a regime tag. A spec card without these is a plan — useful, but it cannot tell you whether the setup is worth the size you are putting on it.

Closing

Your trade log is a sample from a distribution. The job is to estimate that distribution well enough to size correctly and stop early when it shifts. That is the only forecast worth making — and it has nothing to do with predicting price.

Use the past to bound expectation, not to predict it. The gap between forecast R and realized R is then decomposed in the next lesson via MAE/MFE slippage.

Next in this module

Lesson 5 of 11 — Measuring Slippage with MAE/MFE: now that you can forecast your own R-distribution, the next step is to attribute the gap between forecast and reality to slippage and execution drag.