Generated Video, Images, and Music. One API.

For a while, building generative media features meant assembling a small coalition of vendors. One for images. Another for video. A third for audio. Each with its own API contract, rate limits, billing relationship, and compliance paperwork. Getting them to work together was your problem.

Google has been quietly closing that gap. The Vertex AI Generative Media Suite now includes Veo 3 for video, Imagen 4 for images, and Lyria 3 Pro for music, all accessible from the same platform, under the same enterprise governance umbrella. That is a genuinely new situation for anyone building content-heavy products.

Veo 3: The Part That Actually Surprised People

Video generation has existed for a while now, but every major model had the same limitation: it produced silent video. You got the visuals. The audio was your problem. Runway, Sora (discontinued April 2026), most of the field, silent clips that needed a separate post-production audio step before they were usable in anything real.

Veo 3 changed that. It generates synchronized audio natively alongside the video: dialogue, sound effects, ambient noise, background music, all produced in the same API call. A single prompt generates a video where the character’s footsteps match their movement, the background music fits the scene, and any dialogue is lip-synced. Veo 3 went generally available on Vertex AI in July 2025 and generates up to 60 seconds at 1080p HD. A faster, cost-optimized variant called Veo 3 Fast handles high-volume use cases where top-end quality is not required for every clip.

Imagen 4: Images That Actually Render Text Correctly

Anyone who has spent time with AI image generation knows the text problem. Ask a model to put words in an image and you get something that looks roughly like letters if you squint at it charitably. Imagen 4 fixes this. Text and typography render accurately, which sounds like a small thing until you realize how many real product use cases involve putting words in an image.

The model supports resolutions up to 2K (2048×2048 pixels) and comes in three tiers: Imagen 4 Fast for rapid iteration and high-volume generation (up to 10x faster than the standard model), Imagen 4 for the flagship balance of quality and speed, and Imagen 4 Ultra for the highest fidelity with strictest prompt adherence. Imagen 4 went generally available in August 2025. Every output carries a SynthID watermark for provenance tracking.

Lyria 3 Pro: Full Songs With Structure You Actually Control

Music generation is the newest piece. Lyria 3 Pro entered public preview on Vertex AI in March 2026 and generates full songs up to three minutes long. The interesting part is structural control: you can specify intros, verses, choruses, and bridges explicitly, and the model follows that blueprint. Most generative music tools give you a song and you take what you get. Lyria 3 Pro gives you a song that fits the shape you describe.

It supports text and image inputs, generating music that matches the mood and content of what you feed it. Vocals, lyrics, and instruments across multiple languages. Like the rest of the suite, every output gets a SynthID watermark.

Why the Suite Matters More Than the Sum of Its Parts

Each of these models is worth knowing about individually. Together, they represent something more useful: a content production stack that does not require managing three vendor relationships, three billing accounts, and three compliance reviews.

Enterprise customers increasingly ask about AI content provenance. SynthID watermarking is built into all three modalities, not bolted on. The governance story is the same across video, images, and music: Vertex AI IAM, VPC Service Controls, Cloud Logging. One platform, one compliance posture.

For ISVs building marketing platforms, e-commerce tools, learning management systems, or entertainment products, the implication is that content generation features that required assembling multiple vendor APIs a year ago can now be built on a single platform with enterprise-grade controls. Whether that changes what you ship next quarter is worth thinking through.

Want to go deeper? Here are a few links worth your time