Google has introduced Veo 3.1, its latest video model, featuring enhanced audio capabilities, more precise editing options, and improved image-to-video results. According to Google, Veo 3.1 is an upgrade from the Veo 3 version released in May, offering more lifelike video generation and better compliance with user instructions.
Google noted that the model enables users to insert objects into videos, seamlessly matching the style of the original footage. Additionally, an upcoming update will let users remove objects from videos within Flow.
Veo 3 already offered editing tools like using reference images to animate characters, generating clips from specified start and end frames, and extending videos based on recent frames. With Veo 3.1, Google has added audio support to all these functions, making the generated clips more dynamic.
The updated model is being deployed to Google’s video editor Flow, the Gemini App, as well as through Vertex and Gemini APIs. Google shared that since Flow debuted in May, users have produced over 275 million videos using the app.