Vivold Consulting

Google's Gemini 3.5 Flash pairs frontier-level intelligence with speed at under half the price

Key Insights

Google introduced Gemini 3.5 Flash, the first in a model series combining frontier intelligence with agentic action - beating the prior 3.1 Pro on nearly all benchmarks, with a big jump on the real-world GDPVal task suite. It runs about 4x faster than other frontier models at under half the price, and is available across Google's products and APIs today. A more capable Gemini 3.5 Pro is due the following month.

Stay Updated

Get the latest insights delivered to your inbox

A frontier model tuned for speed and cost

At I/O 2026, Google introduced Gemini 3.5 Flash, the first in a new series of models built to combine frontier-level intelligence with the ability to take action. The pitch is that you no longer have to trade capability for speed or cost.

What's new

- Against the previous 3.1 Pro, the new Flash is better across almost all benchmarks, with particularly large gains in coding and a striking jump on GDPVal, a benchmark meant to capture real-world, economically valuable tasks.
- On the intelligence-versus-speed tradeoff, Google places it in a class of its own - by its measure roughly 4x faster in output tokens per second than other frontier models while remaining comparable to the best on quality.
- It delivers those capabilities at less than half the price of comparable frontier models, which Google frames as a major lever for companies burning through token budgets.

Why it matters

Google leaned hard on the economics, claiming a company processing around a trillion tokens a day could save over $1 billion annually by shifting roughly 80% of its workloads from other frontier models to 3.5 Flash. The model is already woven into Google's own development - the company says its internal AI dev tools now process more than 3 trillion tokens a day, up from half a trillion in March, creating a feedback loop that improved 3.5. Gemini 3.5 Flash is available today across Google's products and APIs, with a more capable Gemini 3.5 Pro promised the following month.

Related Articles

A US export order pulled Anthropic's top models offline worldwide, igniting an AI-sovereignty backlash

A US export-control directive forced Anthropic to abruptly disable Fable 5 and Mythos 5 for all foreign nationals on June 13, just four days after launch - briefly cutting off even its own overseas staff. Washington cited a jailbreak vulnerability; Anthropic disputed its severity but had to pull global access because it couldn't filter users by nationality in real time. Europe and Canada reacted with alarm, treating it as proof that frontier-AI access can be switched off by a single government overnight.

Huawei's agent-native HarmonyOS 7 moves into the China AI gap Apple can't fill

Four days after Apple confirmed Siri AI won't launch in China, Huawei unveiled HarmonyOS 7, restructuring the OS around an agent-native architecture it calls the beginning of the agent era. Its assistant Xiaoyi, rebuilt as a system-level agent, now drives 2,100+ system capabilities and coordinates 2,000+ third-party AI agents, atop the upgraded openPangu foundation model. With HarmonyOS already past iOS in China's smartphone share, independence forced by US sanctions has become a structural advantage in the one market Apple can't reach at the AI level.

US government orders Anthropic to pull its most powerful models, citing national security

The US government issued an export-control directive forcing Anthropic to immediately disable Fable 5 and Mythos 5 for all customers, citing national security and a reported jailbreak. Anthropic is complying but disputes the basis, arguing the cited technique surfaces only minor, already-known vulnerabilities that rival models can find without any bypass. Every other Claude model remains unaffected and available.