28 June 2026 · edwin-chan

Group stage scorecard: grading all 72 predictions

The group stage is done and every prediction is on the record. Across 72 matches, the model averaged a Brier score of 0.516, with 57% of predictions graded A or A+. Group I (France's group) and Group J (Argentina's group) were the easiest to read. Group H (Spain, Uruguay, Cape Verde) was the hardest, anchored by the Spain 0-0 Cape Verde opener. Here is the full scorecard, group by group.

Every prediction we published for the group stage is now on the record. Seventy-two matches, all scored, all graded. This is the full accountability report.

We grade each prediction using Brier scores. Lower is better. An A+ means the model called it almost perfectly (Brier below 0.20). An F means the model was badly wrong (Brier above 1.60). The full methodology explains the scoring in detail, and the interactive report card lets you explore match by match.

The headline numbers

MetricValue
Total matches72
Mean Brier score0.516
A+ grades23 (32%)
A grades18 (25%)
B grades22 (31%)
C grades8 (11%)
D grades1 (1%)
F grades0 (0%)

More than half of all predictions earned an A or A+. None received an F. The single D was Spain 0-0 Cape Verde on Match Day 1, where the model assigned just 10.5% to the draw.

Draws were the model's weakness. Twenty of 72 matches ended in draws (27.8%), but the model's average draw probability was 23.2%. That 5-point gap between prediction and reality drove most of the C and D grades.

The model's favourite won 43 of 72 matches (59.7%). Underdogs won outright nine times (12.5%). The remaining 20 were draws.

Group rankings by mean Brier

RankGroupMean BrierA+/ABCD/F
1I (France, Senegal, Norway, Iraq)0.2625100
2C (Brazil, Morocco, Haiti, Scotland)0.2715100
3J (Argentina, Algeria, Austria, Jordan)0.2755100
4L (England, Croatia, Ghana, Panama)0.4744110
5F (Netherlands, Japan, Sweden, Tunisia)0.4794200
6K (Portugal, DR Congo, Colombia, Uzbekistan)0.5333210
7B (Canada, Bosnia, Qatar, Switzerland)0.5434110
8E (Germany, Ivory Coast, Ecuador, Curaçao)0.5893120
9A (Mexico, South Africa, South Korea, Czech Republic)0.6223210
10G (Belgium, Egypt, Iran, New Zealand)0.6662400
11D (USA, Australia, Paraguay, Turkey)0.6761500
12H (Spain, Cape Verde, Uruguay, Saudi Arabia)0.8042121

Three groups stand out. Groups I, C, and J all scored below 0.30, meaning the model read nearly every match correctly. At the other end, Group H was the only group above 0.70. The gap between the best and worst group (0.262 vs 0.804) is wide enough to tell a story about where the model works and where it breaks down.

The best groups

Group I: France, Senegal, Norway, Iraq (0.262)

The model's best group. Five of six predictions earned A or A+, and the worst call (Senegal 2-3 Norway, Brier 0.546) was still a B. France won all three matches as projected. Norway finishing second was well-calibrated. The model had a clear read on the power structure in this group.

MatchResultP(result)BrierGrade
France vs Senegal3-157.4%0.276A
Iraq vs Norway1-467.7%0.166A+
France vs Iraq3-079.3%0.073A+
Senegal vs Norway2-339.8%0.546B
France vs Norway4-156.3%0.287A
Senegal vs Iraq5-062.4%0.224A

Group C: Brazil, Morocco, Haiti, Scotland (0.271)

Four A+ grades out of six. Brazil beat Haiti (Brier 0.015, the best single prediction of the entire tournament) and Scotland. Morocco beat Scotland and Haiti. The two non-A+ results were Brazil 1-1 Morocco (Brier 0.847, a B), where the model had Brazil as favourites and the draw surprised, and Morocco 1-0 Scotland (Brier 0.396, an A), where the model had Morocco as slight favourites and was nearly right.

MatchResultP(result)BrierGrade
Brazil vs Morocco1-127.7%0.847B
Haiti vs Scotland0-168.6%0.152A+
Brazil vs Haiti3-090.8%0.015A+
Morocco vs Scotland1-048.7%0.396A
Brazil vs Scotland3-071.1%0.130A+
Morocco vs Haiti4-277.6%0.083A+

Group J: Argentina, Algeria, Austria, Jordan (0.275)

Argentina winning all three matches was projected at high confidence, and the model earned four A+ grades. The only blemish was Algeria 3-3 Austria (Brier 0.816, a B), a wild six-goal draw on Match Day 3 where the model slightly favoured Algeria to win outright.

MatchResultP(result)BrierGrade
Argentina vs Algeria3-069.6%0.149A+
Austria vs Jordan3-164.0%0.196A+
Argentina vs Austria2-064.5%0.198A+
Algeria vs Jordan2-159.3%0.249A
Argentina vs Jordan3-184.9%0.040A+
Algeria vs Austria3-326.6%0.816B

The worst group

Group H: Spain, Cape Verde, Uruguay, Saudi Arabia (0.804)

The model's hardest group by a wide margin. Spain 0-0 Cape Verde on Match Day 1 was the worst prediction of the tournament (Brier 1.572, the only D grade). The model gave that draw a 10.5% probability. Cape Verde 2-2 Uruguay (Brier 1.151, a C) and Saudi Arabia 1-1 Uruguay (Brier 1.073, a C) compounded the problem. The model treated Uruguay as a strong contender and Cape Verde as the group's weakest team. In reality, Uruguay finished with just 2 points and Cape Verde nearly qualified.

MatchResultP(result)BrierGrade
Spain vs Cape Verde0-010.5%1.572D
Saudi Arabia vs Uruguay1-123.4%1.073C
Spain vs Saudi Arabia4-085.4%0.037A+
Cape Verde vs Uruguay2-221.4%1.151C
Spain vs Uruguay1-059.9%0.247A
Cape Verde vs Saudi Arabia0-029.6%0.744B

The lesson from Group H is about draws. Four of six matches were decided (or drawn) by a single goal, and three of those were draws. When a group produces more draws than expected, the model's probabilities are too spread across win/loss outcomes and the Brier penalty compounds.

The middle groups

A few groups deserve brief notes.

Group D (USA, Australia, Paraguay, Turkey) had the second-worst mean Brier (0.676) despite no C, D, or F grades. Five of six matches were B grades: results that were plausible but not confidently predicted. The USA winning the group was expected, but their margins of victory (4-1 vs Paraguay, 2-0 vs Australia) were larger than the model projected, and Australia finishing second ahead of Paraguay was an outcome the model saw as close to a coin flip. The USA losing to Turkey 2-3 on Match Day 3 was the group's only A grade.

Group G (Belgium, Egypt, Iran, New Zealand) was similar: four B grades, reflecting a group where the model had a reasonable read on the hierarchy but couldn't separate the outcomes with confidence. Belgium starting slowly (1-1 draw with Egypt, 0-0 draw with Iran) dragged the scores up before their 5-1 win over New Zealand (Brier 0.073, A+) restored order.

Group E (Germany, Ivory Coast, Ecuador, Curaçao) contained both extremes: Germany 7-1 Curaçao (Brier 0.018, the second-best prediction of the tournament) and Ivory Coast 1-0 Ecuador (Brier 1.058, a C where the model had the underdogs at just 17.2%).

The ten biggest misses

MatchResultP(result)BrierGrade
Spain vs Cape Verde0-010.5%1.572D
Curaçao vs Ecuador0-015.0%1.397C
Qatar vs Switzerland1-115.4%1.353C
England vs Ghana0-016.8%1.322C
Portugal vs DR Congo1-119.3%1.220C
Cape Verde vs Uruguay2-221.4%1.151C
South Africa vs South Korea1-016.6%1.093C
Saudi Arabia vs Uruguay1-123.4%1.073C
Ivory Coast vs Ecuador1-017.2%1.058C
Iran vs New Zealand2-224.5%0.982B

Seven of the ten biggest misses were draws. The other three were underdog wins where the model assigned them less than 20% probability. The pattern is clear: when the model is most wrong, it is usually because it underestimated the draw.

The ten best calls

MatchResultP(result)BrierGrade
Brazil vs Haiti3-090.8%0.015A+
Germany vs Curaçao7-190.0%0.018A+
Spain vs Saudi Arabia4-085.4%0.037A+
Argentina vs Jordan3-184.9%0.040A+
France vs Iraq3-079.3%0.073A+
Belgium vs New Zealand5-179.0%0.073A+
England vs Panama2-078.8%0.074A+
Morocco vs Haiti4-277.6%0.083A+
Portugal vs Uzbekistan5-071.7%0.128A+
Curaçao vs Ivory Coast0-271.5%0.129A+

The best calls share a pattern: strong favourites winning comfortably. When the model assigns a probability above 70%, it is almost always right. The top ten best calls all had pre-match probabilities above 71%. The model's strength is reading power gaps; its weakness is reading competitive matches where draws are likelier than the model expects.

What this means for the knockouts

The group stage exposed one systematic weakness: draw calibration. The model predicted draws at 23.2% on average, but draws happened 27.8% of the time. That gap matters less in the knockout rounds, where draws lead to extra time and penalties rather than a final result. The model's core strength (reading which team is stronger and by how much) translates directly to the single-elimination format.

The full interactive scorecard is on the report card page. Every match, every grade, every probability.


Brier scores are computed from predictions frozen at least 24 hours before kickoff (the "receipt" system). Grades follow the thresholds on the report card page: A+ (below 0.20), A (below 0.45), B (below 1.00), C (below 1.40), D (below 1.60), F (1.60 and above). The model publishes probabilities, not recommendations. Full methodology. Full Terms of Use.

See the live forecast

This note draws on the same calibrated model that powers the full 2026 World Cup forecast — win probabilities for every fixture, projected line-ups, and the tournament-winner picture, refreshed on every run.

Explore the forecast →

See how the forecast holds up

OnThePitch grades every probability against what actually happens, and publishes the calibration in the open. Subscribe to follow the retrospectives and research as they ship — one email per post, no marketing.

Already read on Substack? Follow OnThePitch there — same posts, slightly different format.

2,006 kelime · yayın tarihi: 28 June 2026

#- world-cup-2026