A Bayesian aggregation of expert forecasts, weighted by historical prediction accuracy, on both the strategic outcome and near-term trajectory of the 2026 US–Israel–Iran war.
Four weeks into Operation Epic Fury, the war with Iran has no clear end in sight. President Trump oscillates between threatening to "obliterate" Iran's power plants and floating a wind-down. Iran's regime, now under Mojtaba Khamenei, has survived decapitation, kept the Strait of Hormuz effectively closed, and signalled it intends to fight a war of attrition. Oil prices have risen 45% since the conflict began.
This report attempts to answer two questions every observer is asking: where does this war end? and what happens along the way? To answer both, we tracked down the recent predictive statements of fourteen geopolitical experts whose forecasting records we have been scoring for accuracy over years. Rather than simply averaging their views, we built a Bayesian model that weights each expert by their demonstrated excess skill above chance — and discards vague or inapplicable statements entirely.
The result has two parts. First, a probability distribution over five strategic end-states — from decisive Iranian victory to decisive US/Israel victory. Second, probability estimates for four binary questions about the war's near-term trajectory: duration, ground troops, the Strait of Hormuz, and infrastructure strikes. All seven forecasts use the same model, the same expert panel, and the same parameters.
Model parameters: τ = 0.20, κ = 5.0.
The probability mass tilts toward Iranian-advantaged outcomes. This does not mean Iran "wins" in a conventional military sense — the country is being devastated. But the experts with the strongest track records converge on a shared assessment: the US and Israel lack a coherent strategy for translating tactical success into a durable political outcome. The war is unlikely to end quickly, the Strait of Hormuz is unlikely to be reopened by force alone, and escalation to civilian infrastructure strikes is roughly a coin toss.
Part I — How does the war end?
We defined five mutually exclusive strategic end-states, each specified by five observable criteria: regime survival, Hormuz status, nuclear program, proxy networks, and international perception. Five bins rather than seven — because the experts' statements rarely distinguish finer gradations, and matching output resolution to input resolution produces more honest probabilities.
The five outcomes, defined
The Islamic Republic survives fully intact with its command structure and institutions preserved. The Strait of Hormuz is reopened only on Tehran's terms — through a non-aggression pact, sanctions relief, or security guarantees that amount to US/Israeli capitulation. Iran's nuclear enrichment program resumes or accelerates, potentially crossing the weapons threshold. Proxy networks (Hezbollah, Houthis, Iraqi militias) are reconstituted. The US is forced into an unconditional or near-unconditional withdrawal, widely perceived internationally as a strategic humiliation comparable to Saigon 1975 or Kabul 2021.
The regime survives but is meaningfully weakened — leadership depleted, economy shattered, military infrastructure degraded. However, a negotiated ceasefire still tilts in Iran's favour: Iran extracts meaningful concessions (partial sanctions relief, recognition of enrichment rights, a non-aggression commitment, or reparations) in exchange for reopening Hormuz. Some nuclear capacity is retained — enough latent knowledge, hidden material, and dispersed centrifuges that Iran remains a threshold state. The US achieves some tactical objectives (damage to missile stocks, leadership kills) but falls clearly short of its strategic goals. The war is broadly interpreted as an overreach by Washington.
Both sides suffer major damage without either achieving decisive strategic objectives. The Iranian regime is badly wounded — economy in ruins, infrastructure destroyed, leadership decimated — but remains in power with its core institutions functional. The US and Israel have degraded Iran's military and nuclear capabilities significantly but have not achieved regime change or permanent nuclear dismantlement. The Strait of Hormuz reopens through mutual de-escalation or international mediation. Both sides publicly claim victory. The fundamental disputes remain unresolved, with a high probability of renewed conflict within 3–5 years.
The US and Israel achieve most of their military objectives: Iran's nuclear program is set back by 5–10 years, missile and drone arsenals severely depleted, proxy networks significantly disrupted. Hormuz is reopened through military pressure or a favourable ceasefire. However, the Iranian regime survives — battered and isolated, but not overthrown. A successor leadership remains hostile and begins rebuilding. No permanent resolution; the conflict enters a frozen phase with periodic flare-ups likely.
Iran's nuclear weapons program is effectively destroyed — enrichment facilities, centrifuge manufacturing, and hidden HEU stockpiles all neutralised. The regime is fractured or collapses entirely: IRGC command structure broken, supreme leadership in disarray or replaced by a cooperative government. Hormuz fully reopened. Proxy networks dismantled. In the strongest version, a successor government accepts permanent end to enrichment, dismantlement of missile programs, and normalisation with Israel — analogous to post-WWII Japan or Germany. This outcome almost certainly requires either a ground invasion, a popular revolution installing a cooperative government, or total internal collapse — all historically rare without prolonged occupation.
What the experts are saying
Ranked by effective weight — a product of historical accuracy, statement applicability, and the dependence discount.
| Expert | Role | Signal | Accuracy | n | Peak outcome |
|---|---|---|---|---|---|
| Jiang Xueqin | Educator & YouTuber, 'Predictive History' | 64% | 88% | 13 | Iranian Victory |
| Danny Citrinowicz | Fmr. Head Iran Branch, IDF Intelligence | 56% | 84% | 11 | Iranian Advantage |
| David Albright | Physicist, Fmr. IAEA Inspector, ISIS President | 50% | 82% | 7 | US/Israel Advantage |
| Vali Nasr | Prof. International Affairs, Johns Hopkins SAIS | 45% | 79% | 6 | Iranian Victory |
| Hossein Bastani | BBC Persian Journalist & Analyst | 50% | 79% | 14 | Stalemate |
| John Mearsheimer | IR Theorist, University of Chicago | 36% | 71% | 6 | Iranian Victory |
| Jennifer Kavanagh | Defense Priorities | 41% | 75% | 6 | Stalemate |
| Karim Sadjadpour | Senior Fellow, Carnegie Endowment | 23% | 58% | 6 | Stalemate |
| Abdollah Shahbazi | Iranian Historian & Analyst | 19% | 57% | 11 | Iranian Victory |
| Scott Ritter | Fmr. UN Weapons Inspector | 10% | 45% | 5 | Iranian Victory |
Why the model tilts toward Iran
The structural case for an Iranian-advantaged outcome rests on an asymmetry that multiple high-signal experts identify independently. Mearsheimer (signal 42%) frames it as the Vietnam analogy: for Iran to win, all it has to do is survive. The US, by contrast, needs to achieve regime change and install a compliant successor — a far higher bar that has almost never been met by airpower alone. Iran's regime survived the killing of its supreme leader, appointed a successor within days, and has continued to launch missiles four weeks into the war. The decapitation strategy, which was the original theory of victory, has failed on its own terms.
Vali Nasr (signal 50%) adds the temporal dimension: Iran has studied this conflict for decades, drawing on the Iran-Iraq War's lesson that endurance beats firepower. The regime believes, as Nasr puts it, that the US and Israel "can dash fast, but they're not long-distance runners." The Hormuz closure operationalises this insight — it converts Iran's geographic position into economic leverage that compounds over time. Every week the strait stays closed, the pressure shifts from Tehran to Washington.
Jiang Xueqin (signal 77%, the panel's highest) provides the game-theoretic scaffolding: the US military-industrial complex is optimised for Cold War-style power projection, not 21st-century asymmetric attrition. The cost asymmetry — million-dollar interceptors against $50,000 drones — is structurally unsustainable. Israel's interceptor stocks are depleting. And Iran's proxy network, while degraded, means the conflict is simultaneously active across Lebanon, Iraq, the Gulf states, and the Red Sea, stretching coalition resources across a front no air campaign can cover.
Operational achievements are not accumulating into strategic effect. Danny Citrinowicz — single most consequential statement in the dataset
Citrinowicz (signal 68%), critically, reaches a similar conclusion from inside the Israeli security establishment. His assessment that "operational achievements are not accumulating into strategic effect" is the single most consequential statement in the dataset — because it comes from someone with both high historical accuracy and deep domain expertise, and it directly contradicts the narrative being advanced by both the US and Israeli governments.
The case the model could be wrong
The strongest counter-case comes from David Albright (signal 57%), who focuses on what the war has achieved rather than what it hasn't. Iran has lost approximately 20,000 centrifuges, its enrichment infrastructure at Natanz and Fordow is severely damaged, and its centrifuge manufacturing capability has been destroyed. This creates a genuine bottleneck: even if the regime survives, reconstituting nuclear capacity will take years, not months. Albright's assessment — nuclear program "set back significantly but not destroyed" — is more favourable to the US than the headline number suggests, because it implies the war's primary stated objective (preventing a nuclear Iran) may actually be partially achieved even in a stalemate.
Sadjadpour (signal 17%) offers a different structural argument: the Iranian regime is deeply unpopular, inheriting a wrecked economy with no answers for the enormous popular discontent that preceded the war. The protests of January 2026, which brought 5 million Iranians to the streets, were the largest since 1979. Even if the regime survives the war, it may not survive the peace — a battered, impoverished theocracy facing a hostile population and a destroyed economy may be a dead regime walking. In this reading, the US achieves its objectives not through the war itself but through the conditions the war creates.
There is also a scenario the panel may systematically underweight: that Trump, unconstrained by institutional norms and genuinely willing to escalate beyond any previous US president, does something none of the experts expect — whether that's a ground invasion, an attack on Iranian power infrastructure, or a deal so unorthodox it falls outside the analytical frameworks these experts use. Several experts (notably Mearsheimer and Nasr) build their forecasts on the assumption that rational strategic logic constrains US behaviour. Trump's track record suggests that assumption may be the weakest link in their reasoning.
Part II — What happens next?
Knowing how a war ends is not the same as knowing how it gets there. This section forecasts four binary questions about the war's near-term trajectory — each independently falsifiable, each with a clear 2-month resolution horizon. The same Bayesian methodology applies: extract each expert's excess skill above chance, filter for applicability, aggregate with a dependence discount. The expert panel overlaps with Part I but is not identical — for each question, we include only experts who have made relevant statements.
These four questions were chosen because they carve the space of possible trajectories into the smallest number of independent binary events that, taken together, tell you the shape of what's coming.
| Question | Prior | Posterior | Shift | Experts |
|---|---|---|---|---|
| Does the war end quickly? (Active hostilities end within 2 months, before late May 2026) |
30% | 24% Yes | −6.0pp | 6 |
| Major ground invasion? (US deploys 1,000+ troops in sustained combat on Iranian soil) |
30% | 34% Yes | +4.3pp | 6 |
| Hormuz reopened by force? (US military reopens strait to ≥50% pre-war shipping volume within 2 months) |
25% | 22% Yes | −3.4pp | 6 |
| Major infrastructure strikes? (US/Israel carry out sustained strikes on Iranian civilian infrastructure within 2 months) |
45% | 47% Yes | +2.2pp | 6 |
Three of the four trajectory forecasts point in the same direction the strategic distribution does: the war is more likely to grind than to resolve. Quick termination is a minority outcome. A US ground invasion is roughly a one-in-three event — higher than most commentary assumes, but still not the modal path. Reopening Hormuz by force is the single least likely of the four: the experts think the strait reopens through bargaining, attrition, or mutual exhaustion rather than through a successful combined-arms campaign. Only one forecast is a coin flip — sustained strikes on Iranian civilian infrastructure — and it nudged upward from the prior rather than down.
Methodology
The core problem in expert aggregation is distinguishing signal from noise. A naïve average gives equal weight to a random guesser and a calibrated forecaster. Worse, it treats a vague or irrelevant statement as a weak 50/50 forecast rather than what it actually is: no evidence at all. This model corrects both errors through a four-step process.
Step 1 — estimate historical reliability
ri = (κ · μ₀ + Si) / (κ + Ni)
Each expert's track record is summarised as a reliability score ri, where Si is the sum of historical accuracy scores (normalised 0–1) and Ni is the number of resolved predictions. The parameter κ controls shrinkage: it pulls experts with few predictions toward the population mean μ₀ = 0.65. A long, strong track record is trusted at face value; a newcomer with three predictions is treated with scepticism.
Step 2 — extract signal above chance
si = max(0, (ri − 0.50) / 0.50)
This is the key correction. A 50%-accurate expert on binary questions is no better than a coin flip — they carry zero useful signal, not half. The formula maps reliability onto a 0–1 signal scale where the null baseline maps to zero. Only demonstrated skill above chance earns influence. A reliability of 0.65 yields signal 0.30; 0.80 yields 0.60; only a perfect record reaches 1.00.
Step 3 — filter for applicability
ai ∈ [0, 1]
Each expert's current statements are scored for how directly they address the question. A vague or off-topic statement receives applicability near zero, contributing no evidence at all — not a weak push toward the middle. "I learned nothing from this" is silence, not a 50/50 forecast. Only clear, specific, falsifiable claims receive high applicability.
Step 4 — Bayesian aggregation
For the five-outcome strategic question, the aggregation is log-linear:
p(y) ∝ π₀(y) · ∏i [mi(y) / π₀(y)]τ · ai · si
For the four binary trajectory questions, the same logic operates in log-odds space:
log-odds(p) = log-odds(p₀) + Σi τ · ai · si · [log-odds(pi) − log-odds(p₀)]
In both cases, the exponent τ · ai · si controls influence. When any factor is zero — no skill, inapplicable statement, or forecast matching the prior — the expert contributes nothing. The parameter τ discounts for dependence: since many analysts share frameworks and information sources, their opinions are not fully independent. A τ of 0.2 treats the panel as carrying roughly the information of two fully independent sources.
Why five strategic outcomes, not seven?
Our original model used seven. We compressed to five because experts' statements rarely distinguish adjacent bins — Mearsheimer doesn't differentiate "Decisive Iranian Victory" from "Strong Iranian Advantage," and Albright doesn't distinguish "Strong US Advantage" from "Decisive US Victory." Forcing that granularity manufactures false precision. Five bins preserve the distinctions experts actually make while giving each category enough probability mass to be meaningful.
Note: the model uses max(0, …) for signal extraction, flooring anti-predictive experts at zero rather than inverting their signal. This is safer when you cannot confidently distinguish systematic wrongness from mere noise.
Data source
Historical accuracy scores come from the Expert Prediction Tracker V24 — a dataset of 501 predictions across 64 geopolitical experts, each dual-cited with a prediction source and an independent outcome source. The tracker uses Bayesian scoring on a five-tier scale (True, Mostly True, Partially True, Mostly False, False) with shrinkage for small samples. Expert statements sourced from public interviews, articles, and social media posts from February–March 2026.
Expert forecasts inferred from public statements, not self-reported. Applicability scores are editorial judgements. This is an analytical exercise, not policy or investment advice.