Factlen ExplainerForecasting ScienceExplainerJun 17, 2026, 11:20 AM· 7 min read· #2 of 2 in meta

The Science of Superforecasting: How Prediction is Becoming a Trainable Skill

Once considered a rare innate talent, forecasting global events and scientific breakthroughs is increasingly recognized as a trainable cognitive skill. By combining structured probabilistic reasoning with prediction markets and AI, ordinary people are consistently outperforming subject-matter experts.

By Factlen Editorial Team

Share this story

Cognitive Forecasters 40%Meta-Science Researchers 35%AI Integration Optimists 25%

Cognitive Forecasters: Argue that prediction is a trainable skill relying on structured probabilistic reasoning rather than raw subject-matter expertise.
Meta-Science Researchers: Value prediction markets as a vital, crowdsourced triage tool to solve the academic replication crisis.
AI Integration Optimists: Believe that combining human causal reasoning with AI's massive data synthesis will exponentially improve global forecasting.

What's not represented

· Traditional Subject-Matter Experts
· Skeptics of Crowdsourced Intelligence

Why this matters

Understanding how to accurately forecast probabilities allows individuals and organizations to make better decisions under uncertainty. By treating prediction as a trainable skill rather than a guessing game, society can better allocate resources, validate scientific research, and mitigate future risks.

Key points

Forecasting is increasingly recognized as a trainable cognitive skill rather than an innate talent or a byproduct of subject-matter expertise.
Superforecasters rely on structured probabilistic reasoning, heavily utilizing historical base rates and continuous calibration to eliminate cognitive bias.
Prediction markets are successfully being used to forecast the reproducibility of scientific studies with approximately 73 percent accuracy.
When academic prediction markets price a study's chance of replication below 50 percent, they correctly predict a failure over 90 percent of the time.
The future of forecasting is widely expected to be a hybrid model, combining AI's data synthesis capabilities with human causal reasoning.

73%

Accuracy of prediction markets forecasting scientific replication

>90%

Accuracy when markets predict a study will fail to replicate

50-60

Questions used to evaluate a forecaster's baseline accuracy

Humans are biologically hardwired to predict the future, constantly anticipating everything from the weather to the stock market. Yet, when it comes to forecasting complex global events, most people—including highly credentialed subject-matter experts—perform scarcely better than random chance. In recent years, however, a quiet revolution has transformed prediction from a mystical talent into a rigorous, teachable science. By treating forecasting as a measurable cognitive skill rather than an innate gift, researchers have discovered that ordinary individuals can be trained to consistently outperform intelligence analysts, economists, and professional pundits. This shift is democratizing how organizations and governments navigate uncertainty.[1][7]

The foundation of this movement traces back to a massive research tournament funded by the Intelligence Advanced Research Projects Activity (IARPA), the United States intelligence community's research and development wing. Led by University of Pennsylvania psychology professor Philip Tetlock, the project sought to discover if crowdsourced forecasting could beat traditional intelligence gathering. The results were staggering: a specific subset of laypeople, dubbed "superforecasters," predicted geopolitical events with astonishing accuracy, routinely beating analysts who had access to classified information. This proved that accurate forecasting relies less on secret data and more on how a person processes publicly available information.[1][2]

What exactly makes a superforecaster? According to Warren Hatch, CEO of Good Judgment—the commercial spinoff of Tetlock's original research—it is rarely deep subject-matter expertise. In fact, experts are often hampered by their own specialized knowledge, which can lead to overconfidence and tunnel vision. Instead, superforecasters rely on a specific cognitive toolkit centered around structured probabilistic reasoning. They do not think in terms of absolute certainties or vague terms like "likely" or "doubtful." They assign precise percentage probabilities to specific scenarios and constantly update those numbers as new evidence emerges.[1][2]

The core mechanism of this toolkit relies on separating the "outside view" from the "inside view." When faced with a complex question—such as whether a specific technology startup will go public by the end of 2026—a novice forecaster immediately looks at the company's specific financials, leadership team, and product roadmap. This is the inside view. A superforecaster, however, completely ignores the company at first. They start with the "base rate": historically, what percentage of similar startups in this sector go public within a similar timeframe?[2]

Anchoring predictions in historical base rates is a foundational skill in superforecasting.

By anchoring their initial prediction in the statistical base rate, the superforecaster establishes a grounded, objective starting point. Only then do they adjust their probability up or down based on the specific, unique details of the company in question. This disciplined approach prevents forecasters from being swayed by compelling narratives, charismatic leadership, or sensationalized recent news cycles. It forces the human brain to respect historical averages, which are statistically far more reliable than individual case studies. Over hundreds of predictions, this simple cognitive habit dramatically reduces error rates and filters out emotional bias.[2][7]

Another critical component of the superforecaster's methodology is "calibration"—the alignment between a forecaster's stated confidence and their actual accuracy. If a perfectly calibrated forecaster predicts one hundred different events with 70 percent confidence, exactly 70 of those events will occur. Most ordinary people are vastly overconfident, routinely assigning 90 percent certainty to events that only happen half the time. Training programs have demonstrated that even brief interventions, such as taking a 90-minute fundamental forecasting course, can significantly improve a person's calibration, teaching them to recognize the limits of their own knowledge.[2][6]

Beyond geopolitics and finance, these structured forecasting techniques are now being deployed to solve one of academia's most persistent and expensive problems: the scientific replication crisis. Across psychology, economics, and medicine, a disturbing number of foundational, highly cited studies have failed to hold up when independent researchers attempt to reproduce the original experiments. Conducting direct replications is incredibly expensive and time-consuming, meaning the scientific community needs a reliable way to triage which published papers to trust and which to view with skepticism.[3][4]

To address this bottleneck, researchers and organizations like the U.S. Defense Advanced Research Projects Agency (DARPA) have turned to prediction markets. In these specialized academic markets, scientists and trained forecasters use real or simulated currency to buy and sell "shares" based on whether they believe a specific published finding will successfully replicate in a rigorous secondary trial. The trading price of a share effectively becomes the crowd's consensus probability that the study is scientifically sound. If traders spot methodological flaws or suspiciously perfect p-values, they short the study, driving its probability score down.[3][7]

To address this bottleneck, researchers and organizations like the U.S.

The results of these academic prediction markets have been remarkably robust. Pooled data from multiple large-scale replication projects reveals that prediction markets can forecast replication outcomes with approximately 73 percent accuracy, consistently outperforming simple surveys of experts. They are particularly effective at identifying flawed research; when a prediction market prices a study's chance of replication below 50 percent, the market is correct in predicting a failure more than 90 percent of the time. This mechanism effectively crowdsources scientific skepticism, aggregating the collective intuition and statistical scrutiny of the research community.[3][4]

Prediction markets consistently outperform traditional surveys when forecasting whether a scientific study will replicate.

By providing a rapid, low-cost confidence score for new discoveries, prediction markets are saving funding agencies millions of dollars. Instead of investing heavily in unproven behavioral interventions or medical treatments based on a single flashy paper, policymakers can consult the market. If the superforecasters and scientific peers heavily doubt the finding, resources can be redirected toward more robust research. This application proves that forecasting is not just about predicting the future; it is a powerful tool for establishing the truth in the present.[4][7]

As superforecasting matures as a discipline, it is currently facing its next major evolution: the integration of artificial intelligence. With the rapid advancement of large language models throughout 2025 and 2026, AI systems are increasingly being tested as autonomous forecasters on public prediction platforms like Polymarket and Kalshi. Tech companies and academic researchers are racing to determine whether an AI, armed with the entirety of the internet's historical data and real-time news feeds, can out-predict a trained human expert. Early markets are already pricing in the probability of AI models achieving frontier forecasting benchmarks by the end of the decade.[5]

Early data shows that advanced models like Anthropic's Claude and OpenAI's latest architectures possess an unparalleled ability to synthesize vast amounts of news, financial reports, and historical data almost instantly. However, AI models still struggle with the nuanced, real-world causal modeling that human superforecasters excel at. While an AI can perfectly retrieve a historical base rate or summarize a complex geopolitical treaty, it often fails to weigh conflicting human evidence accurately. Furthermore, AI systems frequently stumble when modeling unprecedented "black swan" scenarios where historical training data is either irrelevant or actively misleading.[5][6]

Consequently, the consensus among elite forecasting organizations is that the immediate future does not involve AI replacing human judgment, but rather a powerful hybrid approach. AI tools are being seamlessly integrated into the forecaster's workflow to rapidly pull base rates, summarize foreign-language news broadcasts, and automatically check the human's logic for known cognitive biases. The human superforecaster then takes this synthesized data and provides the final causal reasoning, weighing the abstract human elements—like political ego or cultural friction—that algorithms consistently miss. This symbiotic relationship amplifies the strengths of both while mitigating their respective weaknesses.[6]

The future of forecasting relies on a hybrid model that combines AI's data synthesis with human causal reasoning.

This hybrid model is already yielding tangible results in corporate strategy, supply chain risk management, and public policy planning. By examining the detailed chain of reasoning provided by human-AI forecasting teams—something a purely black-box AI system cannot reliably provide on its own—corporate and government leaders can rigorously stress-test their assumptions. They can trace the causal links of a specific prediction, identify hidden risks in their operations, and make high-stakes decisions with a mathematically grounded sense of confidence. The value lies not just in the final percentage, but in the transparent logic used to reach it.[6][7]

Ultimately, the democratization of forecasting represents a profound shift in how society handles uncertainty. By moving away from loud punditry, emotional gut feelings, and rigid subject-matter expertise, and moving toward calibrated, probabilistic thinking, organizations and individuals are becoming vastly better equipped to navigate an increasingly complex world. Whether it is validating a breakthrough cancer treatment, anticipating a supply chain disruption, or simply making better personal financial choices, the tools of the superforecaster are universally applicable. Predicting the future will never be flawless, but by treating it as a rigorous science, humanity can ensure that its collective foresight is the sharpest it has ever been.[7]

How we got here

2011
The U.S. intelligence community launches a massive forecasting tournament to find better ways to predict global events.
2015
Philip Tetlock publishes 'Superforecasting,' detailing how ordinary citizens outperformed intelligence analysts in the tournament.
2019
DARPA launches the SCORE program to test if prediction markets can accurately assess the credibility of social science research.
2021
Pooled data from multiple replication projects confirms prediction markets forecast scientific reproducibility with 73% accuracy.
2025
Forecasting organizations begin formally integrating large language models into a hybrid human-AI prediction workflow.

Viewpoints in depth

The Cognitive Forecaster's View

Focuses on the mechanics of human reasoning and the elimination of bias.

This camp emphasizes that humans are naturally poor at predicting the future because of cognitive biases, particularly overconfidence and the tendency to ignore historical base rates. They argue that by training individuals to think in precise probabilities and constantly update their beliefs based on new evidence, society can drastically improve decision-making. For them, the process of calibration—learning to align one's confidence with actual accuracy—is the most critical skill a modern knowledge worker can develop.

The Meta-Science View

Focuses on using crowdsourced predictions to clean up academic research.

Researchers in this camp are deeply concerned with the replication crisis, where foundational studies in psychology and medicine fail to hold up under scrutiny. They view prediction markets not as a financial tool, but as an epistemic one. By forcing scientists to put 'skin in the game' and bet on whether a study will replicate, these markets aggregate the community's unspoken skepticism. This provides funding agencies with a highly accurate, low-cost metric to determine which research deserves further investment.

The AI Integration View

Focuses on the synergy between large language models and human judgment.

This perspective acknowledges that while human superforecasters are currently the gold standard, the sheer volume of global data is becoming impossible for humans to process alone. They advocate for a hybrid workflow where AI models instantly retrieve historical base rates, summarize foreign-language news, and flag potential cognitive biases in human reasoning. The human's role then shifts from data gathering to pure causal modeling and final judgment, creating a forecasting engine far more powerful than either humans or AI alone.

What we don't know

Whether large language models will eventually develop the causal reasoning capabilities necessary to out-predict human superforecasters without assistance.
How prediction markets can be scaled to evaluate highly niche or highly classified research where the pool of qualified forecasters is too small to form a liquid market.
The exact long-term impact of widespread forecasting training on general public decision-making and financial literacy.

Key terms

Base Rate: The historical average or statistical probability of an event occurring within a specific category, used as a starting point for predictions.
Calibration: The degree to which a forecaster's confidence matches their actual accuracy; a perfectly calibrated forecaster is right 70% of the time when they are 70% confident.
Prediction Market: An exchange where individuals trade shares based on the outcomes of future events, using financial incentives to aggregate collective knowledge.
Brier Score: A mathematical metric used to measure the accuracy of probabilistic predictions, heavily penalizing extreme overconfidence.

Frequently asked

Can anyone become a superforecaster?

Yes. Research shows that forecasting is a trainable skill. While some baseline numeracy helps, techniques like probabilistic reasoning and calibration can be learned by anyone.

Why are subject-matter experts often poor forecasters?

Experts often fall prey to the 'inside view,' overweighing specific details of their field while ignoring historical base rates. They also tend to be overconfident in their highly specialized knowledge.

How do prediction markets help science?

They allow the scientific community to place bets on whether a published study will successfully replicate, providing a highly accurate, low-cost confidence score before expensive replication trials are run.

Will AI replace human forecasters?

Current consensus suggests a hybrid future. While AI excels at synthesizing vast amounts of data and retrieving base rates, human forecasters are still required for complex causal reasoning and stress-testing assumptions.

Sources

[1]The Washington PostCognitive Forecasters
Good Judgment CEO Warren Hatch discusses what makes a 'super forecaster,' prediction markets and the role of AI in forecasting
Read on The Washington Post →
[2]Good Judgment IncCognitive Forecasters
Superforecasting Fundamentals and Training
Read on Good Judgment Inc →
[3]Royal Society Open ScienceMeta-Science Researchers
Predictions of replicability: Can researchers predict if classic findings replicate?
Read on Royal Society Open Science →
[4]National Institutes of HealthMeta-Science Researchers
Predicting replicability—Analysis of survey and prediction market data from large-scale forecasting projects
Read on National Institutes of Health →
[5]MLQ.aiAI Integration Optimists
AI Prediction Markets Brief: February 2026
Read on MLQ.ai →
[6]Good Judgment 2026 ReviewAI Integration Optimists
Good Judgment's 2025 in Review: AI and Superforecasting
Read on Good Judgment 2026 Review →
[7]Factlen Editorial Team
Synthesis by Factlen editorial team
Read on Factlen Editorial Team →

Up next

Cognitive Science

The Science of Steelmanning: How Constructive Disagreement Upgrades Our Thinking

Psychologists and philosophers are championing 'steelmanning'—the practice of strengthening an opponent's argument before critiquing it—as a powerful antidote to modern polarization.

Stay informed

Every angle. Every day.

Get meta stories with full source coverage and perspective breakdowns delivered to your inbox.

Get the briefing →Browse meta