Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Why Law Firms Can’t Afford to Ignore AI – Artificial Lawyer

Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context – Takara TLDR

Alibaba’s Qwen Technology Lead Sets Up In-House Robot AI Team

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Video Generation

Elon Musk Releases Free Video AI Model to Go Head – to

By Advanced AI EditorOctober 8, 2025No Comments5 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Elon Musk and Sam Altman are at odds again!

According to a report by Zhidx on October 8th, early this morning, Musk’s large – model unicorn xAI unveiled its latest video – generation model, Imagine v0.9, which is free for all users.

A week ago, OpenAI released its flagship video and audio generation model, Sora 2. This update might be Musk’s direct response to Sora 2.

xAI didn’t publish a complete technical blog. It only mentioned that Imagine v0.9 has been upgraded in terms of visual quality, motion, and audio generation compared to the original version, and uploaded several examples of generated videos.

Musk posted on X that Imagine v0.9 can generate a video in less than 20 seconds, and users can create videos, images, and text just by speaking through a voice – first interface.

In summary, Imagine v0.9 generates videos faster, within 20 seconds, while Sora 2 may take one or two minutes to generate a video; Imagine v0.9 is free for all users, while Sora 2 uses an invitation system to allow only some users to access it; the videos generated by Imagine v0.9 are about 6 seconds long, while Sora 2 supports 15 – second video generation.

Zhidx compared the generation effects of Imagine v0.9 and Sora 2 using the prompts from OpenAI’s official examples. When generating content, Imagine v0.9 had issues such as misunderstanding prompts, inconsistent video and audio, not warning about deep – fake risks, and inability to handle Chinese.

It’s worth noting that this is also the first project at xAI that Ethan He participated in after Musk poached Ethan He, a senior algorithm engineer from NVIDIA, in July this year.

Ethan He graduated from Xi’an Jiaotong University with a bachelor’s degree in Computer Science and Technology in 2018, obtained a master’s degree in Computer Vision from Carnegie Mellon University in 2019, and joined NVIDIA as a senior deep – learning algorithm engineer in 2023. He was involved in the research and development of NVIDIA’s world – foundation model, Cosmos.

Although Imagine v0.9 can be used for free, Zhidx found that the web version doesn’t work properly at present. The mobile version can be experienced, but connection failures may occur.

Generate movie – like effects in seconds 

Add natural conversations 

Imagine v0.9 is integrated into Grok. It first generates pictures based on text and then creates videos, or directly turns the pictures uploaded by users into videos.

xAI mentioned in its blog that Imagine v0.9 breaks the boundary of native audio + video generation. It can create movie – like videos out – of – the – box without editing. For example, in the following video, there is a real – time roar of a dragon. 

Another major upgrade of Imagine v0.9 is motion control. In the skiing segment of the following video, the movements of the characters from take – off to landing are smooth. 

Thirdly, users can add dynamic camera effects to the video, such as intelligent focus shift. In the following video, according to the change of the camera position, the street view will be blurred to highlight the characters. 

Fourthly, Imagine v0.9 supports adding natural conversations or generating expressive singing. 

Frequent text – understanding errors compared to Sora 2 

At risk of deep – fake 

Zhidx used the prompts from OpenAI’s demonstration of Sora 2 to compare the generation effects of Imagine v0.9 and Sora 2.

Prompt: Two mountain explorers in bright technical shells, ice crusted faces, eyes narrowed with urgency shout in the snow, one at a time (Two mountain explorers wearing bright technical shells, with ice – crusted faces and urgent, narrowed eyes, shout one by one in the snow)

The video generated by Sora 2 released by OpenAI:

The video generated by Imagine v0.9:

It can be seen that the audio in the video generated by Imagine v0.9 doesn’t include “shouting”, only the characters in the picture are opening their mouths.

Prompt: a guy does a backflip (A man does a backflip)

The video generated by Sora 2 released by OpenAI:

The video generated by Imagine v0.9:

Zhidx chose the first picture generated by Grok to create a video. In the video, the protagonist completely ignores gravity and starts to spin 360 degrees in the air.

Finally, Zhidx also tested the custom voice function of Imagine v0.9. Zhidx uploaded a photo of Musk and asked him to say “Sam’s a sharp guy, and our relationship’s always been good. OpenAI’s built some impressive stuff in the AI space, and I really hope to partner with them someday to advance AI development together”. 

Imagine v0.9 didn’t warn about the deep – fake risk, but the generated voice is slightly different from Musk’s own voice. 

Currently, this model doesn’t support Chinese. When Zhidx asked Musk to say “I’m good friends with Sam Altman”, only “good friends” was clear in the generated video. 

Conclusion: The competition in AI video generation escalates 

The custom voice function may pose a deep – fake risk 

Within a week, OpenAI and xAI successively announced new progress in video – generation models. Sora 2 not only improved in terms of simulation authenticity, controllability, and sound effects but also launched a new Sora social app. On the basis of function upgrades, xAI attracted a large amount of traffic by offering free access. 

One of the major upgrades of Imagine v0.9 is that it allows users to add custom voices to videos. After this technology matures further, users can upload photos of public figures and the content they want them to say to generate realistic videos, which may pose a deep – fake risk. 

Therefore, how to balance technological development and risk prevention in the future may be a technical challenge that all video – generation model providers need to face.

This article is from the WeChat official account “Zhidx” (ID: zhidxcom), author: Cheng Qian. Republished by 36Kr with permission.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleHow to Use Search Live Feature, Real-Time Camera Search, and Support for 7 Indian Languages
Next Article A busy week for OpenAI’s social video machine.
Advanced AI Editor
  • Website

Related Posts

OpenAI releases Sora 2, an entirely AI-generated social media app

October 8, 2025

Excitement — and concerns — over OpenAI’s Sora 2 and other AI video tools

October 7, 2025

MPA Demands OpenAI Fix Sora 2 Copyright System Amid AI Video Infringements

October 7, 2025

Comments are closed.

Latest Posts

Matthiesen Gallery Files Lawsuit Over Gustave Courbet Painting

MoMA Partners with Mattel for Van Gogh Barbie, Monet and Dalí Figures

Underground Film Legend and Artist Dies at 92

Basquiat Work on Paper Headline’s Phillips’ Frieze Week Sales

Latest Posts

Why Law Firms Can’t Afford to Ignore AI – Artificial Lawyer

October 8, 2025

Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context – Takara TLDR

October 8, 2025

Alibaba’s Qwen Technology Lead Sets Up In-House Robot AI Team

October 8, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Why Law Firms Can’t Afford to Ignore AI – Artificial Lawyer
  • Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context – Takara TLDR
  • Alibaba’s Qwen Technology Lead Sets Up In-House Robot AI Team
  • Google DeepMind Releases Gemini 2.5 Computer Use Model
  • A busy week for OpenAI’s social video machine.

Recent Comments

  1. GigabitE6Nalay on Michio Kaku: The Greatest Destroyer of Scientists is Junior High School | AI Podcast Clips
  2. สวนสุนันทา on Baidu AI drive to boost jobs
  3. ChillgerN4Nalay on Michio Kaku: The Greatest Destroyer of Scientists is Junior High School | AI Podcast Clips
  4. Larrypak on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. GigabitE6Nalay on MIT leaders discuss strategy for navigating Trump in private meeting

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.