Google DeepMind Unveils Veo 2: A New Video-Generation AI to Rival OpenAI's Sora

Google DeepMind, the AI research lab of Google, has announced Veo 2, its next-generation video-generation model and successor to its earlier version, Veo. Designed to outpace OpenAI’s Sora, Veo 2 boasts the ability to generate two-minute video clips in 4K resolution—four times the resolution and over six times the duration of what Sora can achieve.

While these specifications are impressive, they remain theoretical for now. Currently, Veo 2 is accessible only via Google’s experimental VideoFX tool, where its output is capped at 720p resolution and eight-second durations, compared to Sora’s 1080p and 20 seconds.

Expanded Access and Future Plans

VideoFX is available only to users on a waitlist, though Google plans to expand access this week. According to Eli Collins, VP of product at DeepMind, the company also intends to integrate Veo 2 into Vertex AI, Google’s developer platform, as the model becomes ready for broader use.

“Over the coming months, we’ll continue iterating based on user feedback and look for compelling use cases across Google’s ecosystem,” Collins told TechCrunch. Updates are expected in 2025.

Improved Capabilities

Like its predecessor, Veo 2 generates videos based on text prompts (e.g., “A car racing down a freeway”) or a combination of text and reference images. However, the new model introduces several enhancements:

Sharper Textures and Images: Clips are clearer and handle scenes with significant motion more effectively.
Advanced Camera Controls: Veo 2 enables precise positioning of the virtual “camera” and dynamic movement, allowing for diverse angles and perspectives.
Realistic Motion and Effects: The model simulates fluid dynamics (e.g., coffee pouring) and lighting effects (e.g., reflections and shadows) with improved realism. It also supports cinematic effects and nuanced human expressions.

DeepMind shared sample videos, which highlighted Veo 2’s strengths, including realistic liquids like syrup and Pixar-style animations. However, challenges persist, such as lifeless eyes in characters and inconsistencies in complex scenes, like pedestrians blending into backgrounds or physically implausible building facades.

Addressing Limitations

Collins acknowledged the areas needing improvement:

Coherence and Consistency: Adherence to complex prompts over extended durations remains a challenge.
Detail and Realism: Fast motions and intricate details require further refinement.
Character Consistency: Maintaining character traits across frames is still under development.

DeepMind is working with artists, including Donald Glover and The Weeknd, to refine the model and ensure it aligns with creative workflows.

Training and Ethical Concerns

Veo 2 was trained on a vast dataset of video-description pairs, though DeepMind has not disclosed specific sources. YouTube content, given Google’s ownership, is a likely contributor.

While Google provides tools for webmasters to block data scraping, DeepMind does not currently allow creators to remove their works from training datasets. The company maintains that training on public data constitutes fair use, a stance contested by some artists and filmmakers.

DeepMind has implemented safeguards, including filters for explicit or violent content and SynthID watermarking to prevent misuse, though no watermarking technology is foolproof.

Updates to Imagen

Alongside Veo 2, DeepMind announced enhancements to Imagen 3, its commercial image-generation model. The updated version, rolling out to users of Google’s ImageFX, creates more vivid, detailed images in styles like photorealism, impressionism, and anime.

A new chiplet UI feature in ImageFX will allow users to refine prompts with suggested descriptors, improving usability and creative control.

Looking Ahead

With Veo 2 and Imagen 3, DeepMind is strengthening its position in generative AI, competing with OpenAI and other rivals. As Veo 2 integrates into Google’s platforms and matures, it could play a central role in reshaping video generation and creative industries, though ethical and technical hurdles remain to be addressed.

Technology

Supreme Court to Hear TikTok’s Challenge Against Federal Ban

ByHaider Shahzad 18/12/202419/12/2024

WASHINGTON — The Supreme Court announced Wednesday that it will take up TikTok’s appeal against a federal law that could ban the app in the United States starting January 19. The court’s decision to hear the case comes just a day after TikTok filed its appeal, with oral arguments scheduled for January 10. The law…

Technology

WhatsApp Business Beta Update Introduces AI-Powered Replies and New Theme Colors

ByHaider Shahzad 19/12/202421/12/2024

WhatsApp Business has rolled out a new beta update for Android, version 2.24.26.16, through the Google Play Beta Program. This latest release brings several enhancements, including AI-powered replies, business platform integration, and refreshed theme colors designed to improve user experience. Updated Theme Colors The update introduces updated light and dark themes, replacing the old light…

Technology

OpenAI Launches 1-800-CHATGPT for Phone Calls and Texts

ByHaider Shahzad 19/12/202421/12/2024

OpenAI has introduced a new way for users to interact with its popular AI chatbot, ChatGPT. U.S. users can now call 1-800-CHATGPT to access the service, while users worldwide can message the same number via WhatsApp. Key Features Part of a Broader Expansion This new feature is part of OpenAI’s ongoing efforts to make ChatGPT…

Business | Technology

Intel Scores GPU Win: Arc B580 Sells Out After Stellar Reviews

ByHaider Shahzad 23/12/202423/12/2024

A Bright Spot in a Tough Year for Intel Intel’s 2023 has been challenging, but the launch of the Arc B580 “Battlemage” GPU is proving to be a much-needed success. Priced at $250, this discrete graphics card has received rave reviews, with demand so high that it’s already sold out at many retailers. Intel has…

Technology | Science

KAIST Develops Revolutionary Exoskeleton to Help Paraplegics Walk

ByHaider Shahzad 24/12/202424/12/2024

WalkON Suit F1: A Game-Changer in Assistive Technology Researchers at the Korea Advanced Institute of Science and Technology (KAIST) have unveiled an innovative wearable robot, the WalkON Suit F1, designed to empower paraplegic individuals to walk, navigate obstacles, and climb stairs. The lightweight exoskeleton, created by KAIST’s Exoskeleton Laboratory, combines cutting-edge robotics with advanced sensory…

Latest | Technology

Iran Votes to Lift Ban on WhatsApp

ByHaider Shahzad 26/12/202426/12/2024

Supreme Council of Cyberspace Approves Unblocking Iran’s Supreme Council of Cyberspace, the nation’s top internet regulatory body, has voted unanimously to lift the ban on WhatsApp, according to state media reports. The popular messaging app had been restricted in Iran for over two years. Along with WhatsApp, the council also decided to remove restrictions on…

Expanded Access and Future Plans

Improved Capabilities

Addressing Limitations

Training and Ethical Concerns

Updates to Imagen

Looking Ahead

Related Posts