NewsTechnologyArtificial IntelligenceBaidu’s AI video model MuseSteamer is here: 5 wild video samples
The MuseSteamer is a vision language model that is capable of generating a 10-second video in 1080p resolution.
Baidu’s MuseSteamer is an enterprise-focused AI video tool designed for generating high-quality, synchronized media content. (Image: Baidu)
After Google’s Veo 3 and OpenAI’s Sora showed the world how AI can generate hyperrealistic videos from text prompts, Chinese AI companies seem to be catching up. Chinese search giant Baidu recently launched its first video generation model, MuseSteamer. This model is the first AI video generation tool that generates videos with synchronised Chinese audio.
The model allows users to generate visuals, sound effects, and spoken Chinese dialogue simultaneously. Reportedly, this is beneficial for advertisers, marketers, and anyone who wants to make high-quality videos without spending millions in production costs or working through extended timelines. The MuseSteamer is essentially a business-only AI tool which turns images into short videos. Baidu has also upgraded its search offerings by making them smarter, multimodal, and more personalised.
You have exhausted your
monthly limit of free stories.
Read more stories for free
with an Express account.
Do you really want to read this story? Become a subscriber now.
This premium article is free for now.
Register to continue reading this story
Do you really want to read this story? Become a subscriber now.
This content is exclusive for our subscribers.
Subscribe now to get unlimited access to The Indian Express exclusive and premium stories.
MuseSteamer is a Vision Language Model (VLM), which is a type of AI model that comes with the combined capabilities of computer vision and natural language processing. VLMs allow machines to understand and process information through images and texts, and they also let them perform tasks that require the combined understanding of visual and text data.
MuseSteamer is capable of creating 10-second clips in 1080p resolution with fully synced visuals, spoken dialogue and sound effects. Those who got to try Baidu’s MuseSteamer seem to be raving about the outputs of the model. Here are some stunning video samples shared by X users.
The AI model is available in three tiers – Turbo, Pro, and Lite – which is focused on enterprise users. While Veo 3 and OpenAI’s Sora are consumer-centric video models, MuseSteamer has been designed for businesses. The latest advancement from Baidu has intensified the generative AI race in China, where players like ByteDance, Tencent, Alibaba, etc., are already making rapid strides.
In May, at the Google I/O, the Alphabet Inc. company had introduced its AI video generation model, Veo 3, which has been lauded for its hyperrealistic videos. With its latest offering Baidu seems to be aiming to outpace giants like Google, OpenAI, and even Runway in this segment.
© IE Online Media Services Pvt Ltd
Expand