Gemini Omni is Google's any-modality-in, any-modality-out model, starting with video

May 19, 2026

Key Insights

Google announced Gemini Omni, a new model family capable of generating output in any modality from any input - combining Gemini's intelligence with Google's generative media models in what it frames as a leap in world understanding. The first release, Gemini Omni Flash, starts with video outputs (with image and text to follow) and is available today in the Gemini app, Google Flow, and YouTube Shorts. API access for developers and enterprises follows in the coming weeks.

Stay Updated

Get the latest insights delivered to your inbox

From predicting text to simulating reality

Google introduced Gemini Omni, a model designed to generate samples in any output modality from any input - part of a broader shift the company describes as AI moving from predicting text to simulating reality through world models.

What it does

- Omni combines Gemini's reasoning with Google's generative media models, which Google frames as a significant step forward in world understanding.
- The first model in the family, Gemini Omni Flash, starts with video outputs, with image and text generation to be enabled over time.
- It's available starting now in the Gemini app, Google Flow, and YouTube Shorts, with rollout to developers and enterprise customers via APIs in the coming weeks.

The bigger picture

Omni sits alongside Google's other world-simulation work shown at I/O - including Project Genie, which generates explorable real-world places - and reflects a strategic bet that unifying intelligence with generative media is the next frontier. Combined with the breakout success of Google's Nano Banana image models (more than 50 billion images generated to date), it underscores how central generative media has become to Google's roadmap. The natural question Omni raises, as with any high-quality generative video, is provenance - which is why Google paired its I/O media news with an expansion of SynthID watermarking and Content Credentials, now joined by partners including OpenAI, Kakao, and ElevenLabs.

Source: blog.google

An AWS knowledge-graph deployment turned 6-month research cycles into 3 weeks - and the blueprint transfers far beyond pharma

An AWS GraphRAG deployment in pharmaceutical research cut R&D cycles by 87% - initial discovery that took six months now closes in three weeks - by fusing siloed internal databases and public literature into one queryable knowledge graph on Amazon Neptune Analytics and Bedrock (running Claude). Every answer comes with verifiable citations and a mapped reasoning path, which is exactly what regulated industries need for compliance. The architecture is modular and, crucially, transferable: any enterprise drowning in fragmented legacy data can copy this pattern.

July 9, 2026

SpaceX, Anthropic, and OpenAI listings will out-value every US VC-backed exit since 2000 - reshaping vendor economics for everyone

The new NVCA-Pitchbook Venture Monitor dropped a stunning claim: the pending OpenAI and Anthropic IPOs, together with SpaceX's listing, will generate more value than every US VC-backed exit since 2000 combined. SpaceX is already public at $1.77 trillion, and with both AI labs pushing toward trillion-dollar debuts, the trio should land north of $4 trillion - against roughly $70 billion in total US IPO proceeds last year. For anyone buying AI services, the labs' shift to public-market scrutiny will reshape pricing, transparency, and vendor stability.

July 9, 2026

A 14-person open-source team just became the default way 8.9M developers run local AI - and a lever for slashing inference bills

Ollama, the open-source tool that lets developers run open-weight AI models on their own machines in minutes, raised a $65M Series B led by Theory Ventures ($88M total), revealing it now serves 8.9 million developers monthly and sits inside 85% of the Fortune 500 - with just 14 employees. Founders Jeff Morgan and Michael Chiang previously built Docker Desktop, and they're repeating the play: abstract away the hardware pain, then monetise a cloud tier priced on GPU time rather than tokens. The backdrop is the industry's loudest cost debate: every company with heavy inference bills is under existential pressure to shift routine workloads to open models.

July 9, 2026

Key Insights

Stay Updated

From predicting text to simulating reality

What it does

The bigger picture

Related Articles

An AWS knowledge-graph deployment turned 6-month research cycles into 3 weeks - and the blueprint transfers far beyond pharma

SpaceX, Anthropic, and OpenAI listings will out-value every US VC-backed exit since 2000 - reshaping vendor economics for everyone

A 14-person open-source team just became the default way 8.9M developers run local AI - and a lever for slashing inference bills