Backtest & Calibration

Every forecast Hermes publishes is checked against history before you are asked to trust it. This page runs a walk-forward backtest, reports calibration, shows the reporting-lag correction the corpus implies, and measures how many spatial clusters DBSCAN would find on shuffled noise.

Reporting-lag-corrected volume forecast

raw JSON

Walk-forward calibration (50% and 95% prediction intervals)

If the model is honest, ~50% of actual months fall inside the dark band and ~95% inside the light band. Large deviations mean the intervals are either over-confident (too narrow) or sand-bagged (too wide).

raw JSON

Cluster-noise floor

Compare real DBSCAN clusters to the number found on the same points shuffled uniformly within the observed bounding box. A ratio above ~2x is a robust signal.
raw JSON

Forecasts describe report-volume behaviour (a function of observers, media, and seasonality), not UAP activity. Calibration metrics are walk-forward, mimicking real-time conditions. Spatial shuffling within the observed bounding box is a conservative null; population density inflates cluster counts in noise, so signal-to-noise above ~2.0 is where we treat a cluster as robust.