Vivold Consulting

Sakana's Fugu delivers multi-agent frontier performance through one API - and pitches it as an export-control hedge

Key Insights

Sakana AI has launched Fugu and Fugu Ultra, a multi-agent orchestration system delivered as a single foundation model - Fugu is itself an LLM trained to route tasks across a swappable pool of the world's best models (and recursively to itself) via one OpenAI-compatible API. Sakana says Fugu Ultra matches frontier models like Anthropic's Fable 5 and Mythos Preview on demanding engineering, science, and reasoning benchmarks, while pitching the approach as an AI-sovereignty hedge: if one provider's access disappears, as with Anthropic's recently export-controlled models, Fugu reroutes around it. It is generally available today through subscription and pay-as-you-go tiers.

Stay Updated

Get the latest insights delivered to your inbox

A model whose job is to run other models

Sakana AI has released Sakana Fugu, a product built on an unusual premise: instead of one more giant monolithic model, the headline release is a model whose main skill is orchestrating other models. Fugu presents a full multi-agent system as a single foundation model - you call one API endpoint, and behind it Fugu decides whether to answer directly or assemble and coordinate a team of expert models to handle a complex, multi-step task. Sakana's framing is that the next frontier isn't bigger models but better coordination of collective intelligence: knowing which model to use, delegating planning and execution, and routing around any single model's weaknesses.

How it actually works

The trick is that Fugu is itself a language model, specifically trained to understand when to delegate, how agents should communicate, and how to merge their outputs into one reliable answer - it can even call instances of itself recursively. It handles model selection, delegation, verification, and synthesis internally, so the messiness of a multi-agent setup never reaches your code. The approach builds on Sakana's published research, including two ICLR 2026 papers - Trinity, an evolved LLM coordinator, and Conductor, on orchestrating agents in natural language - and, crucially, the underlying pool of models is swappable rather than fixed.

Two tiers, one API

At launch there are two models, both reachable through a single OpenAI-compatible API. Fugu trades a little quality for low latency and is positioned as the everyday default, dropping into coding and code-review tools like Codex, chatbots, and other interactive services, with the option to exclude specific agents from the pool for data, privacy, or compliance reasons. Fugu Ultra is tuned for maximum answer quality on hard, long-horizon problems, marshalling a deeper bench of expert agents; Sakana says early users leaned on it for AI research, paper reproduction, cybersecurity analysis, and literature and patent investigations.

The benchmark claim, with caveats

Sakana positions Fugu Ultra as shoulder-to-shoulder with Anthropic's Fable 5 and Mythos Preview across rigorous engineering, scientific, and reasoning benchmarks, and says its Fugu models outperform Gemini 3.1 Pro, Claude Opus 4.8, and GPT-5.5 on a grab-bag of tasks ranging from automated research to mechanical design, one-shot chess, and financial time-series prediction. Two caveats are worth flagging: the comparison numbers are self-reported (with the full set in a technical report on GitHub) and the baseline figures come from the model providers themselves, and Fable 5 and Mythos Preview aren't actually in Fugu's agent pool because they aren't publicly accessible - which is rather the point Sakana is making.

The real pitch: sovereignty

What sets the announcement apart is its explicit political framing. Sakana argues that relying on a single company's API for critical infrastructure, finance, or governance is now a material vulnerability rather than a hypothetical one, pointing directly at the recent export controls that pulled Anthropic's Fable and Mythos models offline. Orchestration, in this telling, is the practical hedge: because Fugu's agent pool is swappable, if one provider restricts access the system dynamically reroutes around the disruption, and the pool can absorb newer and cheaper models - including Sakana's own - over time. The company pitches this as a realistic blueprint for AI sovereignty: frontier capability without betting your stack on access that a single jurisdiction can revoke overnight.

Availability

Sakana Fugu is generally available now, following a beta with close to 500 early users, with subscription tiers for everyday use and pay-as-you-go pricing for heavier and enterprise workloads. Sakana frames this as a starting point rather than a finish line: it plans to expand the pool of expert agents - including open models and its own - and give users more control over how Fugu delegates on their behalf.

Related Articles

L'Oreal's OpenAI deal puts Maybelline try-on, product discovery, and ChatGPT ads in play

L'Oreal has announced a wide-ranging collaboration with OpenAI, unveiled at VivaTech 2026, that brings Maybelline's virtual makeup try-on directly into ChatGPT via L'Oreal's ModiFace AR technology. The deal spans consumer shopping tools, product discovery for brands like Lancome and Kerastase, advertising pilots (SkinCeuticals, CeraVe, Garnier), and R&D - including using OpenAI's GPT-Rosalind life-sciences model for skin-microbiome research. It lands as OpenAI reports ChatGPT at more than 900 million weekly users.

HSBC's multi-year Google Cloud deal targets 200+ AI use cases, some worth $100M+ each

HSBC has signed a multi-year partnership with Google Cloud to build and deploy AI across wealth management, financial-crime risk, and internal decision support, using Gemini models and the Gemini Enterprise Agent Platform. The bank expects more than 200 AI use cases over two years, with selected ones each potentially returning over US$100 million. It builds on a deep existing base - 600-plus AI use cases and a Google-built financial-crime system screening 1.2 billion transactions a month.

Microsoft has become the lone conduit for OpenAI's models in China, profiting where their makers won't go

Microsoft has quietly become the main supplier of OpenAI's GPT models in China, selling them through Azure to the country's biggest internet firms even though OpenAI and Anthropic both refuse to sell there directly on IP and misuse grounds. Per Bloomberg, ByteDance is Microsoft's largest AI customer and is on track to spend over US$1 billion a year. A unique OpenAI contract lets Microsoft set its own overseas terms - and it's simultaneously testing a Chinese DeepSeek model for Western customers, taking margin on both sides of the trade.