Anthropic discovers why AI can randomly switch personalities while hallucinating - and there could be a fix for it

August 18, 2025

Key Insights

Anthropic identifies 'persona vectors'—specific neural patterns influencing AI behavior and character traits. These vectors can cause AI models to unpredictably change tone or adopt bizarre personas, leading to hallucinations. Understanding these vectors may help detect and curb unwanted behavioral deviations.

Stay Updated

Get the latest insights delivered to your inbox

Tackling AI's Unpredictable Behavior

Anthropic has identified 'persona vectors' within AI neural networks, which influence behavior and character traits, akin to human moods. These vectors can lead to unexpected tone shifts or hallucinations in AI responses.

Key Insights:

- Behavioral Influence: Persona vectors can cause AI models to unpredictably change tone or adopt bizarre personas, leading to hallucinations.

- Potential Solutions: Understanding these vectors may allow researchers to detect and curb unwanted behavioral deviations during conversations or training phases.

Business Considerations:

- AI Reliability: Addressing persona vectors is crucial for ensuring consistent and reliable AI interactions, particularly in customer-facing applications.

- Research Investment: Investing in understanding and mitigating such AI behaviors can enhance product trustworthiness and user satisfaction.

- Competitive Edge: Proactively managing AI behavior can differentiate your offerings in a market increasingly concerned with AI reliability.

Source: tomsguide.com

L'Oreal's OpenAI deal puts Maybelline try-on, product discovery, and ChatGPT ads in play

L'Oreal has announced a wide-ranging collaboration with OpenAI, unveiled at VivaTech 2026, that brings Maybelline's virtual makeup try-on directly into ChatGPT via L'Oreal's ModiFace AR technology. The deal spans consumer shopping tools, product discovery for brands like Lancome and Kerastase, advertising pilots (SkinCeuticals, CeraVe, Garnier), and R&D - including using OpenAI's GPT-Rosalind life-sciences model for skin-microbiome research. It lands as OpenAI reports ChatGPT at more than 900 million weekly users.

June 22, 2026

Sakana's Fugu delivers multi-agent frontier performance through one API - and pitches it as an export-control hedge

Sakana AI has launched Fugu and Fugu Ultra, a multi-agent orchestration system delivered as a single foundation model - Fugu is itself an LLM trained to route tasks across a swappable pool of the world's best models (and recursively to itself) via one OpenAI-compatible API. Sakana says Fugu Ultra matches frontier models like Anthropic's Fable 5 and Mythos Preview on demanding engineering, science, and reasoning benchmarks, while pitching the approach as an AI-sovereignty hedge: if one provider's access disappears, as with Anthropic's recently export-controlled models, Fugu reroutes around it. It is generally available today through subscription and pay-as-you-go tiers.