연구

예측은 어떻게 만들어지는가 — 그리고 왜 신뢰할 수 있는가

모든 확률은 출시 전에 8×90일 워크포워드 게이트로 백테스트되며, 대회 기간 동안 실제 결과에 대해 실시간으로 평가됩니다 — 방법론, 백테스트, 한계 모두 공개되며, 실패한 실험도 포함됩니다.

50편의 짧은 글, 22편의 방법론 문서, 25편의 연구 및 백테스트 노트.

3개 독립 모델의 평균
8×90일 워크포워드 게이트
실패한 실험도 게시
공개 데이터만으로 구축

이 수치를 신뢰할 수 있을까요?

검증받기 위해 만들어졌습니다

공개된 확률을 진지하게 받아들여야 하는지를 결정하는 것: 실제 결과에 대한 검증 성적, 성공과 함께 공개되는 실패 기록, 그리고 모든 수치 뒤에 있는 버전 관리된 기록.

전체 논증 · 무료

Why trust these numbers

A probability publication is a credibility game. Anyone can publish numbers; the question is whether those numbers track outcomes once the matches finish. This page collects the d…

실시간 보정 추적기

수치가 결과를 추적하고 있는가?

Brier score와 등급별 보정 결과를 실제 결과에 대해 평가하며, 대회 기간 내내 업데이트됩니다. 70%로 평가된 결과는 약 70%의 확률로 발생해야 합니다 — 여기서 확인하세요.

부정적 결과

실패한 실험들

게이트를 통과하지 못한 모든 모델 변형을 판정과 함께 전문 공개합니다. 미채택 결과는 성공만큼 투명하게 공개됩니다.

모델 변경 로그

모든 버전을 기록으로

모델의 버전 관리된 이력 — 모든 재훈련과 아키텍처 변경이 출시 시점의 Brier score와 함께 기록되며, 전체 노트로 연결됩니다. 각 페이지의 수치는 여기의 날짜가 기록된 행으로 추적할 수 있습니다.

방법론 핵심

여기서 시작하기

모델의 작동 방식을 알고 싶다면 먼저 읽어야 할 세 가지 문서입니다. 전문을 무료로 읽을 수 있습니다.

How we make predictions

How our 2026 World Cup prediction model works

Our 2026 FIFA World Cup forecasts come from a statistical prediction model that blends three approaches — an Elo rating system, a Dixon-Coles Poisson goals model, and a hierarchic…

How we make predictions

What we predict and how

For every prediction target — match outcomes, goal totals, scorelines, individual player events — there's a standard modelling approach and a set of input variables. This page cat…

Behind the scenes

Where our data comes from

The quality of any prediction depends on the data behind it. This page maps every data source we use — from free public archives to commercial feeds — and explains what each one p…

전체 22편 문서 →

우리가 시도한 것

연구 노트

모델 구축 과정의 의사결정 로그: 가설, 백테스트, 결과, 채택/미채택 판정. 실패한 실험은 성공과 함께 기록됩니다.

Shipped · 29 June 2026

Neural Poisson: a nonlinear extension of Dixon-Coles

The ensemble's three existing models share a structural constraint:

Not shipped · 3 June 2026

A within-match chase layer "passes" the headline gate — and the placebo proves it shouldn't

The feasibility probe found that, after controlling for team strength, only

Shipped · 31 May 2026

Testing our approach on the Champions League final

The `/test/live/<slug>/` route renders the live-tracker pipeline

전체 25편 연구 노트 →

최신 게시글

최근 변경 사항

가장 최근 모델 실행과 발견 사항에 대한 짧은 노트.

1 July 2026 · edwin-chan