Concept · The simulator

Tick Data

Individual trade-level records from an exchange — every single transaction, timestamped to the millisecond — as opposed to the aggregated OHLCV (open, high, low, close, volume) candles used in most backtesting engines.

Tick Data

In plain English

A standard 1h candle gives you five numbers: the price at open, the highest price touched, the lowest price touched, the closing price, and total volume. A lot happened inside that hour — thousands of individual trades — but the candle collapses them all into five numbers.

Tick data keeps every trade. For a 1h window on BTC perps, that might be 50,000–200,000 individual records, each with a timestamp, price, and size. Binance publishes this as aggregate trade data — trades at the same price and millisecond are compressed into one row, reducing the count but preserving timing.

What tick data resolves that OHLC cannot

1. TP/SL simultaneous-trigger resolution

Take-profit (TP — a preset price to auto-close in profit) and stop-loss (SL — a preset price to auto-close in loss) can both fall inside a single candle's high-low range. A 1h candle that spans 0.8% has both a high and a low that may each trigger one of them. OHLC data cannot tell you which happened first — the engine must guess. Tick data resolves this exactly.

2. Limit-order fill timing

The OHLC-range fill check (see limit order) tells you whether the candle's low touched your limit price, not when. Tick data gives the exact millisecond. Queue position (how many other orders were ahead of you) is still unknowable from public trade data — but timing is precise.

3. Intra-candle liquidation sequencing

At high leverage, the exact sequence of price ticks determines whether a liquidation cascades to other positions. OHLC only gives the per-candle extremes.

What tick data does NOT resolve

Realistic slippage requires order book depth (L2), not tick data. This is a common misconception. Tick data shows the price path — every price at which a trade occurred. It does not show how much liquidity existed at each level. To model "I tried to buy 5 BTC and consumed the book up 0.15%", you need L2 snapshots: the full bid/ask ladder at every point in time. See slippage.

Tick data without L2 tells you the price visited your limit. It does not tell you how much of your order filled before price moved on.

Storage cost at 6 years, 20 symbols

Binance's aggregate trade data uses approximately 45 bytes per row (agg_trade_id, price, qty, first/last trade ID, timestamp, side flag).

Symbol	Trades/day (agg)	6-year rows	Raw size
BTC	~1M	~2.2B	~100 GB
ETH	~500k	~1.1B	~50 GB
SOL (~4 years active)	~300k	~440M	~20 GB
17 other alts	~200k each	~260M each	~12 GB each
Total		~4B rows	~370 GB raw

In Postgres with indexes: approximately 1 TB.

Comparison:

Current 1h OHLCV candles (20 symbols, 6 years): ~100 MB
1m OHLCV candles: ~5–6 GB
Tick data: ~1 TB — 10,000× larger than 1h candles

Compute cost

The replay engine iterates through events per strategy per symbol. Current 1h replay:

6 years × 365 × 24 = 52,560 candles per symbol

With tick data on BTC:

6 years × 365 × ~1M agg trades/day = ~2.2 billion events per symbol

That is a ~42,000× multiplier per symbol over current 1h replay. For 1,500 strategies × 20 symbols, tick-level replay in Postgres would take weeks per full fleet run. Viable only with a specialized columnar time-series engine (ClickHouse, QuestDB, TimescaleDB) and parallelized compute.

Is the precision worth it for this fleet?

For 1h and 4h swing strategies — currently the only strategies with measured edge in this fleet — mostly no. A swing strategy hunting a 2–10% move does not gain meaningful fidelity from knowing its 1h-candle entry happened at 14:32 vs 14:58. The fill timing ambiguity is small relative to the target magnitude.

Tick data becomes valuable when:

Testing sub-1m strategies (scalpers targeting 0.05–0.2%)
TP/SL are set tight enough to frequently both land inside the same candle
Liquidation sequencing is a core modeling concern

The practical intermediate path: 1-minute sub-bars

For the specific TP/SL simultaneous-trigger problem, 1-minute candles are a tractable middle ground:

60× more data than 1h candles (not 42,000×)
TP and SL that both fall inside a single 1h candle are almost always in different 1m candles — the trigger order becomes unambiguous
Standard OHLCV data, no order book required
~6 GB total for 20 symbols over 6 years
Implementation: signal evaluation stays on 1h candles; fill resolution runs a sub-pass on 1m data when both TP and SL are within the same parent candle's range

This delivers ~95% of the TP/SL resolution benefit of full tick data at ~0.01% of the storage cost.

simulator fidelity
slippage
limit order
take profit stop loss

Sources

wiki/qa-sessions/2026-06-29-session.md#q4 (first asked here)
apps/backend/src/evaluation/position/fill-resolver.ts (current OHLC fill model)

Related concepts

See it in a real result →

Put it to the test

Does your idea have a real edge, or just a big number?

Spawn your variant, run it on the same engine, and read the edge-significance verdict — before you risk real money.

Test your own idea — free →Free account, no card