How Superforecasters and Prediction Markets Are Changing the Science of Decision-Making
By combining disciplined probability tracking with financial incentives, a new era of forecasting is replacing vague punditry with measurable accuracy.
By Factlen Editorial Team
- Prediction Market Advocates
- Believe financial stakes and market liquidity are the best mechanisms for surfacing the truth.
- Forecasting Scientists
- Value structured aggregation, Brier scores, and rigorous testing of predictions over time.
- AI Optimists
- Argue that large language models will soon surpass human collective intelligence in predictive accuracy.
- Traditional Analysts
- Emphasize the importance of domain expertise and qualitative context that pure statistics might miss.
What's not represented
- · Behavioral Psychologists
- · Regulatory Authorities
Why this matters
Understanding how to think in probabilities rather than absolutes empowers individuals to make better financial, career, and life decisions in an increasingly uncertain world.
Key points
- Traditional expert predictions are often vague and rarely checked for accuracy.
- Superforecasters use disciplined, probabilistic thinking to consistently beat intelligence analysts.
- Prediction markets use financial incentives to filter out bad predictions and surface the truth.
- AI models are highly competitive in forecasting, but humans still lead on complex, novel events.
Every day, we are bombarded with predictions about the future. Will inflation drop? Will a new technology succeed? Will a geopolitical conflict escalate? Traditional media and experts often leave the public more confused than informed, hedging their bets with vague words like "might" or "could." Because these predictions lack quantifiable probabilities, they are nearly impossible to verify, allowing pundits to claim victory regardless of the actual outcome.[1][6]
For decades, the accuracy of expert predictions was rarely checked or held to any rigorous standard. In his landmark research, psychologist Philip E. Tetlock famously demonstrated that the average expert forecaster was roughly as accurate as a "dart-throwing chimpanzee." When experts speculated on major global events, their deep domain knowledge often made them overconfident rather than accurate, blinding them to alternative outcomes and unexpected variables.[1][2]
But forecasting is not a lost cause; it is a measurable, improvable skill. The paradigm shift began in 2011 when the U.S. Intelligence Advanced Research Projects Activity (IARPA) launched a massive tournament to improve the intelligence community's ability to foresee global events. The goal was to see if structured, crowdsourced forecasting could outperform traditional, siloed intelligence gathering.[2][5]
Tetlock entered the tournament with a team of ordinary citizens, dubbed the Good Judgment Project. Over four years and hundreds of questions, this group of volunteers consistently outperformed trained intelligence analysts who had access to classified information. They also beat the accuracy of random guessing and early prediction markets, proving that a specific methodology could yield remarkable foresight.[2]

These elite predictors became known as "superforecasters." Their success was not due to insider knowledge, genius-level IQs, or psychic intuition, but rather a specific, disciplined approach to thinking. They broke large, complex questions into smaller, manageable parts, actively sought out opposing viewpoints, and constantly updated their probabilities as new information arrived.[2][6]
To truly improve forecasting, one must be able to measure it objectively. The gold standard for this measurement is the Brier score, a mathematical formula that evaluates the accuracy of probabilistic predictions. Developed in 1950, the Brier score calculates the squared deviation between the probabilities assigned to forecasts and the actual binary outcomes.[5]

A Brier score ranges from 0 to 1. A perfect score is 0, indicating absolute accuracy, while a forecaster who simply guesses 50/50 on every question receives a baseline score of 0.25. Crucially, the Brier score heavily penalizes overconfidence—being 100% certain about an event that ultimately does not happen destroys a forecaster's rating, structurally encouraging intellectual humility.[5]
A perfect score is 0, indicating absolute accuracy, while a forecaster who simply guesses 50/50 on every question receives a baseline score of 0.25.
While tournaments proved that humans could be trained to predict better, a parallel movement sought to crowdsource accuracy using financial incentives: prediction markets. Platforms like Polymarket and Kalshi have turned probability into a tradable asset, allowing users to buy and sell shares based on the likelihood of future events.[1][4]
In a prediction market, if a share for a "Yes" outcome trades at 60 cents, the market implies a 60% probability of that event occurring. Because traders have their own money on the line, the market naturally filters out low-quality, performative punditry. People who are consistently wrong lose money, while accurate forecasters profit, creating a powerful financial incentive for truth-seeking.[1]
By 2026, these platforms are processing billions of dollars in volume each month. They track everything from the success of new cancer drugs to macroeconomic stability, with risk-monitoring markets alone accounting for roughly $3.8 billion in trading volume. The financial stakes force participants to calculate thoughtfully and weigh evidence, creating a real-time, living assessment of global probabilities.[1][4]

The combination of superforecasting techniques and prediction markets has created a powerful engine for collective intelligence. But a new challenger has recently entered the arena: artificial intelligence. Large Language Models (LLMs) are now being deployed as a new form of collective intelligence, aggregating human knowledge at an unprecedented scale.[3]
Recent studies have tested these AI models against human aggregates in forecasting tournaments like ForecastBench. AI models excel at processing vast amounts of historical data and identifying patterns, making them highly competitive on questions with established datasets and short feedback loops.[3]
However, human superforecasters still hold a commanding lead on complex, novel market questions where historical data is sparse and human intuition is required to weigh unprecedented variables. The current frontier of forecasting relies on the synergy between human reasoning and AI processing power, rather than one replacing the other.[3][6]

For the average person, the lessons of superforecasting offer a concrete toolkit for better decision-making in everyday life. Research shows that simply learning the basic principles of forecasting—such as embracing doubt, avoiding absolute certainty, and updating beliefs incrementally—can improve an individual's predictive accuracy by 10%.[2]
Ultimately, the science of forecasting teaches us that "knowing what we don't know is better than thinking we know what we don't." By demanding verifiable accuracy, utilizing the wisdom of crowds, and thinking in probabilities rather than absolutes, society can navigate an uncertain world with greater clarity, confidence, and capability.[1][2]
How we got here
2004
James Surowiecki publishes 'The Wisdom of Crowds', popularizing the idea that groups can out-predict experts.
2011
The U.S. government launches the IARPA forecasting tournament to improve intelligence gathering.
2015
Philip Tetlock publishes 'Superforecasting', detailing how ordinary citizens beat trained analysts.
2026
Large Language Models begin competing directly against human aggregates in forecasting tournaments.
Viewpoints in depth
Forecasting Scientists
Value structured aggregation, Brier scores, and rigorous testing of predictions over time.
Researchers in decision science argue that society relies too heavily on charismatic pundits who speak in absolutes. They advocate for a world where all public forecasts are tracked, scored, and ranked using objective metrics like the Brier score. By demanding verifiable accuracy, they believe we can structurally improve how governments and corporations make critical decisions.
Prediction Market Advocates
Believe financial stakes and market liquidity are the best mechanisms for surfacing the truth.
Proponents of platforms like Polymarket and Kalshi argue that talk is cheap. They believe that the only way to get a truly honest assessment of probability is to force people to put their own money on the line. In their view, the financial pain of losing money naturally weeds out bad actors and performative outrage, leaving behind a highly efficient, real-time consensus of reality.
AI Optimists
Argue that large language models will soon surpass human collective intelligence in predictive accuracy.
Technologists point out that AI models can ingest and analyze historical data at a scale impossible for human teams. While they acknowledge that humans currently hold an edge on novel, unprecedented events, they argue that as AI models improve their reasoning capabilities, they will inevitably become the ultimate superforecasters, capable of modeling complex global systems in real-time.
Traditional Analysts
Emphasize the importance of domain expertise and qualitative context that pure statistics might miss.
While acknowledging the power of crowds and algorithms, traditional intelligence and financial analysts caution against relying entirely on statistical aggregates. They argue that deep domain expertise is still required to understand the qualitative nuances, cultural contexts, and "black swan" variables that a purely quantitative model might overlook.
What we don't know
- Whether prediction markets can maintain their accuracy if they become heavily regulated or restricted.
- How quickly AI models will close the gap with human superforecasters on complex, unprecedented events.
- Whether the general public will ever widely adopt probabilistic thinking over traditional, absolute punditry.
Key terms
- Superforecaster
- An individual who consistently achieves exceptional prediction accuracy by using disciplined, evidence-based reasoning rather than raw intuition.
- Brier Score
- A metric used to measure the accuracy of probabilistic predictions, ranging from 0 to 1, which penalizes overconfidence.
- Prediction Market
- An exchange where individuals trade contracts that pay out based on the outcome of future events, using financial stakes to crowdsource probabilities.
- Wisdom of the Crowd
- The theory that the collective opinion of a diverse group of individuals is often more accurate than that of a single expert.
- Base Rate
- The historical frequency of an event occurring, which superforecasters use as a starting point before adjusting for new information.
Frequently asked
Can anyone become a superforecaster?
Yes. Research shows that forecasting is a skill that can be cultivated. Simply learning the basic principles of probability and avoiding overconfidence can improve accuracy by 10%.
How do prediction markets prevent manipulation?
While large trades can temporarily shift odds, markets generally self-correct. If a "whale" artificially inflates a price, it creates a profitable opportunity for other traders to bet against them, restoring the true probability.
What is a Brier score?
A Brier score is a mathematical formula used to measure the accuracy of a forecast. It ranges from 0 (perfect accuracy) to 1 (perfectly wrong), and heavily penalizes forecasters who are 100% confident but end up being incorrect.
Will AI replace human forecasters?
Currently, AI excels at processing historical data, but human superforecasters still outperform AI on complex, novel questions. The most accurate forecasts today come from combining human intuition with AI processing.
Sources
[1]Prediction NewsPrediction Market Advocates
Accuracy at Stake: How Prediction Markets Are Changing the Forecasting Game
Read on Prediction News →[2]BookBrowseForecasting Scientists
Superforecasting: The Art and Science of Prediction
Read on BookBrowse →[3]Royal Society PublishingAI Optimists
Crowdsourced forecasting and large language models
Read on Royal Society Publishing →[4]Asterisk MagazinePrediction Market Advocates
Prediction Markets in 2026
Read on Asterisk Magazine →[5]OSF PreprintsForecasting Scientists
Tracking Forecasting Accuracy in Intelligence Organizations
Read on OSF Preprints →[6]Factlen Editorial TeamTraditional Analysts
Synthesis by Factlen editorial team
Read on Factlen Editorial Team →
Every angle. Every day.
Get opinion stories with full source coverage and perspective breakdowns delivered to your inbox.






