Anthropic, DOE team up to spot dangerous nuclear chats

August 21, 2025

Key Insights

Anthropic partners with the U.S. Department of Energy's NNSA to develop a tool that identifies potentially harmful nuclear weapons-related discussions. The tool achieved a 94.8% success rate in detecting such queries.

Stay Updated

Get the latest insights delivered to your inbox

In a significant move to enhance AI safety, Anthropic has joined forces with the U.S. Department of Energy's National Nuclear Security Administration (NNSA) to create a classifier capable of distinguishing between legitimate scientific inquiries and potentially dangerous conversations about nuclear weapons. This collaboration, which has been ongoing for over a year, aims to ensure the safe deployment of Anthropic's AI model, Claude, in sensitive environments.

Why this matters:

- High detection accuracy: The tool demonstrated a 94.8% success rate in identifying nuclear weapons-related queries, showcasing its effectiveness.

- Minimal false negatives: Only 5.2% of harmful queries were mistakenly classified as benign, indicating a robust safety mechanism.

- Setting industry standards: Anthropic plans to share its approach through the Frontier Model Forum, potentially influencing sector-wide adoption of similar safety measures.

This development underscores the growing collaboration between AI companies and government agencies to address national security concerns, highlighting the importance of proactive safety measures in AI deployment.

Source: axios.com

L'Oreal's OpenAI deal puts Maybelline try-on, product discovery, and ChatGPT ads in play

L'Oreal has announced a wide-ranging collaboration with OpenAI, unveiled at VivaTech 2026, that brings Maybelline's virtual makeup try-on directly into ChatGPT via L'Oreal's ModiFace AR technology. The deal spans consumer shopping tools, product discovery for brands like Lancome and Kerastase, advertising pilots (SkinCeuticals, CeraVe, Garnier), and R&D - including using OpenAI's GPT-Rosalind life-sciences model for skin-microbiome research. It lands as OpenAI reports ChatGPT at more than 900 million weekly users.

June 22, 2026

Sakana's Fugu delivers multi-agent frontier performance through one API - and pitches it as an export-control hedge

Sakana AI has launched Fugu and Fugu Ultra, a multi-agent orchestration system delivered as a single foundation model - Fugu is itself an LLM trained to route tasks across a swappable pool of the world's best models (and recursively to itself) via one OpenAI-compatible API. Sakana says Fugu Ultra matches frontier models like Anthropic's Fable 5 and Mythos Preview on demanding engineering, science, and reasoning benchmarks, while pitching the approach as an AI-sovereignty hedge: if one provider's access disappears, as with Anthropic's recently export-controlled models, Fugu reroutes around it. It is generally available today through subscription and pay-as-you-go tiers.

June 22, 2026

HSBC's multi-year Google Cloud deal targets 200+ AI use cases, some worth $100M+ each

HSBC has signed a multi-year partnership with Google Cloud to build and deploy AI across wealth management, financial-crime risk, and internal decision support, using Gemini models and the Gemini Enterprise Agent Platform. The bank expects more than 200 AI use cases over two years, with selected ones each potentially returning over US$100 million. It builds on a deep existing base - 600-plus AI use cases and a Google-built financial-crime system screening 1.2 billion transactions a month.

June 18, 2026

Key Insights

Stay Updated

Related Articles

L'Oreal's OpenAI deal puts Maybelline try-on, product discovery, and ChatGPT ads in play

Sakana's Fugu delivers multi-agent frontier performance through one API - and pitches it as an export-control hedge

HSBC's multi-year Google Cloud deal targets 200+ AI use cases, some worth $100M+ each