Google is launching the most recent iteration of Imagen 3 today, together with a new video model called Veo 2, both of which produce cutting-edge outcomes. These models are currently accessible in ImageFX, VideoFX, and Whisk, its most recent lab experiment.
Veo 2: Video production
Google is announced that developers may now incorporate Veo 2, Its cutting-edge video creation technology, into their apps. Before beginning to develop a premium tier in the Gemini API, you may test out its features in Google AI Studio.
Veo 2, which is intended to create realistic, high-resolution films with cinematic realism, is a significant advancement in video creation. It provides smooth character movement, realistic settings, and finer visual details across a variety of themes and genres by better understanding human motion and real-world physics. Veo 2 produces videos of exceptionally high quality covering a variety of topics and aesthetics. Veo 2 outperformed the most advanced models in head-to-head comparisons evaluated by human raters.
Its detail and overall realism are enhanced by its deeper comprehension of real-world physics and the subtleties of human movement and emotion. Ask Veo 2 for a genre, a lens, and some cinematic effects, and it will provide them at up to 4K resolutions and up to minutes in length. Veo 2 is aware of the special language of cinematography. Veo 2 can create the wide angle shot that this lens is known for if you enter “18mm lens” in your prompt. Alternatively, you may enter “shallow depth of field” to blur away the background and concentrate on your subject.
Veo 2 has been led by its dedication to responsible development and safety. In order to help identify, comprehend, and enhance the model’s quality and safety while gradually releasing it via VideoFX, YouTube, and Vertex AI, it has purposefully measured the expansion of Veo’s availability.
How to create videos with Gemini
Choose Veo 2 from Gemini’s model dropdown to start creating videos. This function produces a 720p, eight-second video clip that is saved as an MP4 file in a 16:9 landscape orientation. It will let you know when you’re getting close to the monthly cap on the number of videos you may produce.
Gemini makes it easy to produce films; all you have to do is describe the scene you want to create, whether it a short tale, a visual concept, or a particular scene, and Gemini will make your thoughts come to life. You have more control over the finished video the more thorough your description is. This creates a world of exciting creative possibilities where you may rapidly describe brief visual ideas, explore a variety of visual styles from realism to fantasy, and imagine unbelievable combinations.
Sharing what you’ve created with others is one of the nicest aspects. It’s simple to share your video on a mobile device: just hit the share button to submit interesting short movies to YouTube Shorts and TikTok in no time.
Imagen 3: cutting-edge image production
Additionally, Google’s Imagen 3 image-generation technology has been enhanced to produce images that are more well-composed and brighter. It may now more accurately depict a wider range of artistic forms, including impressionism, photorealism, abstract art, and anime. Additionally, this update generates richer textures and details and more faithfully follows commands. Imagen 3 produced state-of-the-art outcomes when human rater outputs were compared side by side with outputs from top image-generating models.

The most recent Imagen 3 model will be made available in more than 100 countries worldwide starting today in ImageFX, its picture production tool from Google Labs. To begin, go to ImageFX.
Whisk
Whisk is a fun new application that allows you to visualise your ideas by prompting you with visuals.
With the newest project from Google Labs, Whisk, you can generate or input photos that represent the scene, subject, and style you want. After that, you can combine them and rework them to make something entirely original, like an enamel pin, sticker, or digital plush toy.
Under the hood, Whisk integrates Gemini’s visual understanding and description skills with the most recent Imagen 3 model. The Gemini model automatically generates a thorough caption for each of your photos, which it then feeds into Imagen 3. You may quickly and easily combine your subjects, situations, and styles in exciting new ways with this approach.
Right now establishes the U.S. launch of Whisk.
Whisk Animate
Google has introduced the Whisk Animate, Using both text and picture cues, the Google Labs project Whisk facilitates rapid exploration and visualisation of novel concepts. Whisk Animate allows you to animate your works .
Using Veo 2, Whisk Animate enables you to create vibrant eight-second films from your photos. Google One AI Premium users worldwide may get it as of right now.