Introducing Gemini Omni

中文日本語 Español

Gemini May 19, 2026

Google is introducing Gemini Omni, a natively multimodal model that creates and edits high-quality videos using text, image, audio, and video inputs.

Read Full Article

Summary

Google has announced Gemini Omni, a new natively multimodal AI model designed to bridge the gap between reasoning and creative production. The model allows users to generate and edit high-quality videos by combining diverse inputs such as text, images, audio, and video. A key feature is the ability to perform complex video editing through natural language, maintaining scene consistency and applying realistic physics. The first release, Gemini Omni Flash, is currently available for Google AI subscribers via the Gemini app, Google Flow, and for users on YouTube Shorts and YouTube Create, with plans to expand access to developers soon. The platform also emphasizes responsible AI, incorporating SynthID watermarking for content transparency.

(Source：Gemini)

中文日本語 Español

Read Full Article

TechCrunch Jul 4, 2026

New Google commercial imagines a Declaration of Independence written with help from AI

Yahoo News Jul 4, 2026

Meta Paid Hundreds of Contractors to Pretend to Be Teenagers While Barraging Its Competitors’ AI With Disturbing Content

TechCrunch Jul 4, 2026

Midjourney wants Hollywood studios to reveal the details of their AI usage

TechCrunch Jul 4, 2026

Alibaba reportedly bans employees from using Claude Code

TechCrunch Jul 4, 2026