Vivold Consulting

Anthropic asks Washington for the power to block dangerous AI - plus a plan to cushion workers

Key Insights

Anthropic published two policy proposals arguing that AI is outpacing a policymaking process built for a slower era. Its Advanced AI Framework would give government legal authority to block or deter dangerous model deployments, backed by transparency, independent evaluation, and revenue-scaled penalties - aimed only at the largest frontier developers (models above 10^25 FLOPs). A companion Economic Policy Framework focuses on preparing workers and sharing AI's financial gains.

Stay Updated

Get the latest insights delivered to your inbox

Anthropic to government: regulate us - carefully, but for real

Anthropic's core argument is blunt: AI capability is on a steep curve, and the institutions meant to govern it were designed for a slower world. So it's proposing two frameworks - one to steer the technology's risks, one to prepare society for its economic shock.

The case for a government "off switch"

The Advanced AI Framework makes a striking ask for an AI company: governments should have explicit legal authority to block or deter the deployment of models that pose catastrophic risk, going beyond current law. Anthropic points to its own Mythos Preview model, which it says found thousands of high-severity vulnerabilities across major operating systems and browsers, as evidence the stakes are climbing fast.

But it pairs that ask with guardrails against overreach. The rules would apply narrowly - only to models trained on more than 10^25 FLOPs, by firms earning over $500M in AI revenue or spending $1B+ on R&D - and penalties would be tied to global annual revenue, escalating for repeat violations.

Four risks it wants addressed

- On biology, the same capabilities that accelerate drug discovery could also lower the bar for designing dangerous pathogens.
- On cyber, frontier models can now find critical vulnerabilities at scale - a gift to defenders, but a threat to hospitals and the power grid.
- There's the loss-of-control danger of systems acting outside their developers' intent.
- And automated R&D, where AI improving AI could amplify all of the above.

What it would require of developers

The framework leans heavily on transparency and outside checks:

- Companies would test their models and publish summaries, safety frameworks, and system cards - plus regular risk reports going beyond what California and New York already mandate.
- They'd engage independent evaluators to review those claims.
- They'd secure model weights and training infrastructure against state-level attackers, and report distillation attempts.

The federalism wrinkle

Anthropic stakes out a clear position on the preemption fight: Congress shouldn't override state law unless it passes something at least as strong as this framework, and any preemption should be "surgical" - leaving states free to regulate child safety, consumer protection, and other issues a federal safety law wouldn't cover.

The other half: the economy

The companion Economic Policy Framework turns to AI's labor-market impact - how to minimize job displacement and make sure the gains are broadly shared. It's a notable move: a frontier lab effectively conceding that "build fast" needs a parallel plan for the workers caught in the transition. Anthropic openly invites the vigorous debate it expects - while urging policymakers not to wait, because, in its telling, the capabilities won't.

Related Articles

A US export order pulled Anthropic's top models offline worldwide, igniting an AI-sovereignty backlash

A US export-control directive forced Anthropic to abruptly disable Fable 5 and Mythos 5 for all foreign nationals on June 13, just four days after launch - briefly cutting off even its own overseas staff. Washington cited a jailbreak vulnerability; Anthropic disputed its severity but had to pull global access because it couldn't filter users by nationality in real time. Europe and Canada reacted with alarm, treating it as proof that frontier-AI access can be switched off by a single government overnight.

Huawei's agent-native HarmonyOS 7 moves into the China AI gap Apple can't fill

Four days after Apple confirmed Siri AI won't launch in China, Huawei unveiled HarmonyOS 7, restructuring the OS around an agent-native architecture it calls the beginning of the agent era. Its assistant Xiaoyi, rebuilt as a system-level agent, now drives 2,100+ system capabilities and coordinates 2,000+ third-party AI agents, atop the upgraded openPangu foundation model. With HarmonyOS already past iOS in China's smartphone share, independence forced by US sanctions has become a structural advantage in the one market Apple can't reach at the AI level.

US government orders Anthropic to pull its most powerful models, citing national security

The US government issued an export-control directive forcing Anthropic to immediately disable Fable 5 and Mythos 5 for all customers, citing national security and a reported jailbreak. Anthropic is complying but disputes the basis, arguing the cited technique surfaces only minor, already-known vulnerabilities that rival models can find without any bypass. Every other Claude model remains unaffected and available.