Google、画像生成Imagen 4、動画生成Veo 3、AI映像制作ツールFlowをリリース〜電子透かしSynthIDの識別を支援する「SynthID Detector」の早期アクセスも案内

Googleは5月20日から開催されているGoogle I/O 2025で、画像生成モデルVeoと動画生成モデルImagenをアップデートしたことを発表した。また、音楽生成モデルLyriaへのアクセスを先月4月に拡大したことを案内し、さらにAI映像制作ツールFlowをあらたにリリースしたことも発表した。

動画生成モデル「Veo 3」

動画生成モデルVeo 3は、Veo 2から品質を向上させ、あらたに音声付きの動画を生成できるようになった。都市の雑踏や公園の鳥のさえずり、キャラクター同士の会話など、多様な音声を動画に追加できるという。

また、テキストや画像を使ったプロンプトから、現実の物理法則や音声にあわせた口の動きの同期も可能とのこと。

現在Veo 3は、アメリカのGeminiアプリおよびFlowでUltraプランのユーザー、またはVertex AIのエンタープライズ向けで利用可能。

Video, meet audio. 🎥🤝🔊

With Veo 3, our new state-of-the-art generative video model, you can add soundtracks to clips you make.

Create talking characters, include sound effects, and more while developing videos in a range of cinematic styles. 🧵 pic.twitter.com/5Hfpetfg8b
— Google DeepMind (@GoogleDeepMind) May 20, 2025

なおVeo 2にも新機能を追加したことを発表した。

画像をもとにキャラクターやシーン、オブジェクト、スタイルを指定できる「参照画像による動画生成」
カメラの回転・移動・ズームを設定できる「カメラコントロール」
映像のフレームを拡張する「アウトペインティング」
相互作用が考慮された「オブジェクトの追加と削除」

現在、参照画像による動画生成機能とカメラコントロール機能がFlowで利用できる。

AI映像制作ツール「Flow」

AI映像制作ツール「Flow」は、Veo、Imagen、Geminiといったモデルを活用し、自然言語で映像のシーンやストーリーを記述できるAI映像制作ツール。キャストやロケーション、オブジェクト、スタイルなどを一元管理して映像を制作できるという。

現在Flowは、アメリカのGoogle AI ProとUltraプランのユーザーが利用可能。

Get into the zone with Flow. 🎬

It combines the best of our most advanced models Veo, Imagen and Gemini into 1️⃣ master filmmaking tool - helping you weave cinematic clips, dynamic scenes, and compelling narratives into stories with consistent results. pic.twitter.com/E2a7NJNuP9
— Google DeepMind (@GoogleDeepMind) May 20, 2025

画像生成モデル「Imagen 4」

画像生成モデル「Imagen 4」は、これまで以上にスピードと精度を両立している。布地や水滴、動物の毛並みなどの細かな表現や、写実的・抽象的なスタイルにも対応したという。最大2Kの解像度の画像を生成できるため、印刷やプレゼンテーションにも最適とのこと。

また、文字描画やタイポグラフィの精度も向上し、グリーティングカードやポスター、漫画の制作にも活用できるという。

現在Imagen 4は、Geminiアプリ、Whisk、Vertex AI、Workspaceの各種サービスで利用可能。

今後、Imagen 3の10倍の速度で処理できる「Imagen 4の高速バージョン」もリリース予定とのこと。

Get ready for Imagen 4 🎨 capable of creating richer images, with more nuanced colors, intricate details and superior typography.

Tap each photo below to see more. 👀 pic.twitter.com/W0vDYu4Z4R
— Google DeepMind (@GoogleDeepMind) May 20, 2025

音楽生成モデル「Lyria 2」

音楽生成モデル「Lyria 2」は、ミュージシャンやプロデューサー、作曲家向けの実験的ツール「Music AI Sandbox」に搭載されているが、このアクセスを4月に拡大している。

現在Lyria 2 は、クリエイター向けにYouTubeショートを通じた提供や、Vertex AIでエンタープライズ向けで利用可能。

MusicFX DJで使われているインタラクティブな音楽生成モデル「Lyria RealTime」が、API経由やAI Studioで利用できる。

We recently expanded access to Music AI Sandbox 🎵 our suite of experimental AI tools for professional musicians powered by our latest model Lyria 2.

🎧 This medley made with artists will transport you through different genres, sounds, and moods. Sound on ↓ pic.twitter.com/3ns1X5ouH8
— Google DeepMind (@GoogleDeepMind) May 20, 2025

生成されたコンテンツに埋め込まれる電子透かし「SynthID」

2023年に導入された電子透かし「SynthID」が、100億以上の画像・動画・音声・テキストのAI生成コンテンツに埋め込まれてきたことも紹介し、Veo 3、Imagen 4、Lyria 2の出力にも引き続きSynthIDが適用されるとした。

今回、AI 生成コンテンツの識別を支援する「SynthID Detector」も新たに発表した。コンテンツをアップロードすると、SynthID Detectorによってファイル全体、または一部にSynthIDが含まれているかの識別ができる。現在、テスター向けに早期アクセスを提供している。

SynthID Detector can quickly identify SynthID watermarks in audio and images – even after edits like cropping and adding filters. ✂️

The tool can even show where in the content it thinks AI has been used. 🔍

Soon, you'll be able to detect AI-generated content in text and video… pic.twitter.com/hGHmsbOWIU
— Google DeepMind (@GoogleDeepMind) May 20, 2025