Text-to-video For AI Characters That Speak

ChatGPT’s ability to ignore copyright and common sense while creating images and deepfakes is the talk of the town right now. The image generator model that OpenAI launched last week is so widely used that it’s ruining ChatGPT’s basic functionality and uptime for everyone.

But it’s not just advancements in AI-generated images that we’ve witnessed recently. The Runway Gen-4 video model lets you create incredible clips from a single text prompt and a photo, maintaining character and scene continuity, unlike anything we have seen before.

The videos the company provided should put Hollywood on notice. Anyone can make movie-grade clips with tools like Ruway’s, assuming they work as intended. At the very least, AI can help reduce the cost of special effects for certain movies.

It’s not just Runway’s new AI video tool that’s turning heads. Meta has a MoCha AI product of its own that can be used to create talking AI characters in videos that might be good enough to fool you.

MoCha isn’t a type of coffee spelled wrong. It’s short for Movie Character Animator, a research project from Meta and the University of Waterloo. The basic idea of the MoCha AI model is pretty simple. You provide the AI with a text prompt that describes the video and a speech sample. The AI then puts together a video that ensures the characters “speak” the lines in the audio sample almost perfectly.

The researchers provided plenty of samples that show MoCha’s advanced capabilities, and the results are impressive. We have all sorts of clips showing live-action and animated protagonists speaking the lines from the audio sample. Mocha takes into account emotions, and the AI can also support multiple characters in the same scene.

Prompt examples for Meta's MoCha AI video generator. — Prompt examples for Meta’s MoCha AI video generator. Image source: Arxiv

Source link

What's Hot

A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning – Takara TLDR

DeepSeek reports shockingly low training costs for R1 in new paper

Abu Dhabi’s TII and NVIDIA Launch Middle East’s First Joint ‘AI & Robotics’ NVAITC Research Lab

Text-to-video for AI characters that speak

How the “Nano Banana” Update Turns Photos into Cinematic Clips

Alibaba Unveils AI Model for Character Animation and Replacement

Google’s Gemini App Uses Veo 3 to Turn Photos into AI Videos

New Collectors Drive Strong Sales at New York Fair

Hidden Portrait May Be Vermeer’s Earliest Known Work

Who Are the Art World Figures on the Time 100 List?

Acquavella Signs Harumi Klossowska de Rola, Daughter of Balthus

A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning – Takara TLDR

DeepSeek reports shockingly low training costs for R1 in new paper

Abu Dhabi’s TII and NVIDIA Launch Middle East’s First Joint ‘AI & Robotics’ NVAITC Research Lab

What's Hot

Text-to-video for AI characters that speak

Tech. Entertainment. Science. Your inbox.

Related Posts

Subscribe to Updates