A couple of months ago, at its I/O developer conference, Google revealed its new media generation models and an announcement came in the form of Veo 3, which can generate videos with sounds, like birds chirping. Google is beginning to roll out Veo 3 to all countries where the Gemini App is available, including India, through the Google AI Pro subscription. In India, you can get AI Pro subscription free for a month and then it will cost Rs 1,950 per month.
Veo 3 allows users to create eight-second videos (with sound). The AI model can generate synthesised speech and scenes with background music.
In the past few weeks, thousands of clips with goofy cats and odd news reports have hit the Internet, some of which have been created with Veo 3.
“We’re entering a new era of creation,” said Google’s VP of Gemini, Josh Woodward, during the keynote. He said the output was “incredibly realistic”... and he hasn’t been off the mark.
Having audio in the video clips makes Veo 3 special. AI-powered sound-generating tools aren’t new but having synthesised audio is cutting-edge.
“All videos created from your photos display a visible watermark and are embedded with an invisible digital SynthID watermark, which indicates the videos are AI-generated,” said David Sharon, multimodal generation lead, Gemini Apps, in a blog post.
Video tools such as Veo 3 have the potential to change the entertainment industry in the long run. A 2024 study commissioned by the Animation Guild, estimated that more than 100,000 US-based film, television, and animation jobs will be disrupted by AI by 2026. There are a number of startups working in this field, like Runway, Genmo, Pika and Luma, besides tech giants like OpenAI and Alibaba.