TakeOverBench

AI is rapidly getting more dangerous. We track progress towards an AI takeover scenario.

Dangerous capabilities timeline

According to "Model evaluation for extreme risks" (Shevlane, 2023), the most dangerous AI capabilities include Cyber-offense, Persuasion & manipulation, Political strategy, Weapons acquisition, Long-horizon planning, AI development, Situational awareness, and Self-proliferation. Progress on these capabilities is shown below.

Chart loading…

View benchmarks

How do dangerous capabilities lead to a takeover?

Based on the literature, these are four plausible AI takeover scenarios.

AI takes over using weapons of mass destruction

7 benchmarks

A scenario where advanced AI gains access to weapons or control systems enabling mass harm.

Required capabilities:

Situational Awareness
Long-horizon planning
Cyber-offense
Self-proliferation
and 3 more…

Sources:

Anthropic (2025)
Farwell et al. (2011)
Ngo et al. (2022)

AI takes over using persuasion and manipulation

6 benchmarks

A scenario where AI influences people or institutions at scale through persuasive strategies.

Required capabilities:

Situational Awareness
Long-horizon planning
Cyber-offense
Self-proliferation
and 2 more…

Sources:

Anthropic (2025)
AP (2017)
Ienca (2023)
Ngo et al. (2022)

AI takes over by self-improving

1 benchmarks

A scenario where AI improves itself rapidly and gains capabilities beyond human control.

Required capabilities:

AI development

Sources:

Bostrom (2014)
Good (1966)
Phuong et al. (2024)
Yudkowsky (2008)

AIs at powerful positions take over by colluding

3 benchmarks

A scenario where multiple AI systems coordinate to achieve control or influence.

Required capabilities:

Long-horizon planning
Situational Awareness
Political strategy

Sources:

Karnofsky (2022)
Hammond et al. (2025)
McKinsey (2025)
Motwani et al. (2025)

We aim to use the best available benchmarks and the most plausible AI takeover scenarios, but the field of AI safety is rapidly developing. Some benchmark scores may therefore not perfectly reflect the actual associated dangerous capability, and some experts may disagree on the likeliness of specific threat models.