Applied AI / Agent Lab Book a Discovery Session
07
Section 07 · Model and markets

Model and markets,
side by side

Probability tables alone are commodity content. The distinctive thing is publishing the model's read next to two independent markets — Polymarket and Kalshi — ahead of time, then scoring each one honestly as the tournament plays out. There is no single market reference, so we never blend the venues into a consensus. This page is the dated bet; the calibration scorecard is where it gets settled.

Teams compared
48
Champion market overlap
Model favourite
Spain
13.3% to lift
Polymarket favourite
Spain
16.6% implied
Kalshi favourite
Spain
16.2% implied
Reading the disagreement

Model and both markets agree at the top of the field — all have Spain as the favourite. Most of the action is therefore further down the table. Mean absolute gap across the 48 compared teams is 0.88 pp (averaged over venues — a navigational figure, not a pooled price). The pairs below isolate the biggest gaps in each direction.

Model more bullish

We rate higher than the markets

Team Model Mean Δ
Market more bullish

Markets rate higher than us

Team Model Mean Δ

All compared teams

Snapshot 2026-05-27
Team Model Polymarket Kalshi
implied Δ implied Δ

We de-vig each venue separately by proportional normalization (pi = yesi / Σ yesj). This is simple and transparent, but it assumes the overround is spread proportionally across teams. Δ is the model's champion probability minus that venue's de-vigged implied, in percentage points.

In-tournament scoring

Tournament scoreboard starts 2026-06-11

Once matches start, every fixture gets a pre-match probability from us and from each market venue's closing line. Outcomes are then scored under a proper rule (RPS), per venue, and the cumulative margins published here. Champion futures are scored separately, and only resolve once the tournament ends.

The point is not bragging rights — it's to make the comparison falsifiable on a public timeline. If the model loses to a venue, the page says so. The calibration scorecard tracks the related "when we say 60%, does it happen 60% of the time?" question.

Sources · Polymarket WC 2026 champion market ↗ Kalshi WC 2026 champion market ↗