Overview
Tick-by-tick order book reconstructions from websocket feeds and REST data. Each row represents the full order book state at that timestamp. The order book is reconstructed from multiple redundant websockets to ensure zero data loss. Three separate datastreams are merged:price_change messages, book
messages, and a scheduled download of all books from the REST API every 5 minutes.
This way in the highly unlikely event the book becomes slightly out of sync,
every 5 minutes it is guaranteed to be correct.
If you plan on downloading high volumes of order book data, please reach out! We
support bulk parquet exports which will allow you to avoid parsing text. Email:
calder@predictiondata.dev.
Columns
Column | Type | Description |
|---|---|---|
exchange_timestamp | integer | Exchange timestamp in milliseconds |
local_timestamp | integer | Local capture timestamp in milliseconds (UTC) |
ask_prices | string | Comma-separated ask prices (lowest to highest) |
ask_sizes | string | Comma-separated ask sizes at each price level |
bid_prices | string | Comma-separated bid prices (highest to lowest) |
bid_sizes | string | Comma-separated bid sizes at each price level |
Notes
- Date format:
YYYY-MM-DD - Prices and sizes are comma separated string for easy parsing
- Book snapshots merge data from multiple websockets to ensure completeness