Behind the scenes

Data sources at a glance

Summary + sample · full document is 1,288 words

Summary

A quick-reference table of every data source, what it covers, and how accessible it is. Companion to the full narrative in "Where our data comes from".

Costs are approximate and as of mid-2026; treat as orders of magnitude. Access difficulty is a 1–5 score: 1 = "git clone, you're done"; 5 = "vendor partnership required."

Sample

Results

SourceWhat it coversCostAccess difficultyNotes
Football-Data.co.ukTop European leagues 1993–present, resultsFree1The starter dataset; CSV per league per season.
FBref (Sports Reference)Per-match per-player aggregates from Opta/StatsPerform, broad historicalFree (rate-limited scrape)2Aggregates only, not raw events.
Wikipedia / WikidataInternational tournaments, historical seasonsFree1Structured back-fill source for pre-2000 data and national-team results.
ClubEloPer-club Elo time-series, free CSVsFree1Useful team-strength prior.

Full document

Pro Pass

Want the full document?

Data sources at a glance runs 1,288 words. The Pass unlocks this document and every research note in full, plus per-fixture probabilities, the four-model comparison, and per-fixture tactical analysis.

View pricing

Every forecast graded against the real result, scored on 987 matches since 2014. See the scorecard.

24h money-back, no questions asked·No subscription, no auto-renewal·Access through 31 Dec 2026. See refund policy.