Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Aurora Mobile’s GPTBots.ai to Integrate Zhipu AI’s Flagship GLM-4.5 Model to Enhance AI Capabilities

Google unveils Gemma, a new family of open AI models for devs

IBM: Shadow AI breaches cost $670K more, 97% of firms lack controls

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Video Generation

Alibaba Releases Wan2.2 AI Video Generation Models, Combining Mixture-of-Experts and Video Diffusion

By Advanced AI EditorJuly 29, 2025No Comments5 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Chinese tech giant Alibaba has released Wan2.2, a major open-source update to its AI video generation models. Unveiled on July 28, the new series directly challenges paid rivals like OpenAI’s Sora and Google’s Veo. It introduces an advanced Mixture-of-Experts (MoE) architecture to improve video quality.

The release also includes a highly efficient 5B model that generates 720p video on consumer-grade GPUs. This move is part of Alibaba’s strategy to lead the open-source AI space by giving powerful, free tools to developers and researchers. It succeeds the company’s Wan2.1 models, which were released earlier this year.

Under the Hood: MoE Architecture and Consumer-Grade HD Video

Wan2.2’s core innovation is the introduction of a Mixture-of-Experts (MoE) architecture into its video diffusion model, a first for the field. This advanced design, widely validated in large language models, allows for a massive increase in the model’s total capacity without a corresponding rise in computational cost during inference. The architecture is specifically tailored to the video generation process, separating the complex denoising task into specialized functions.

The MoE system employs a two-expert design. A “high-noise” expert handles the early stages of generation, focusing on establishing the overall layout and motion of the video. As the process continues, a “low-noise” expert takes over to refine intricate details and enhance visual quality.

According to the project’s technical documentation, this approach increases the model’s total parameter count to 27 billion, but with only 14 billion parameters active at any given step, it maintains the computational footprint of a much smaller model.

To complement this new architecture, Wan2.2 was trained on a significantly expanded and refined dataset, featuring 65.6% more images and 83.2% more videos than its predecessor, Wan2.1. The team placed a heavy emphasis on creating “cinematic-level aesthetics” by using meticulously curated data with detailed labels for lighting, composition, contrast, and color tone.

This allows for more precise and controllable generation, enabling users to create videos with customizable aesthetic preferences, as detailed in the official announcement.

Wan2.2 compared to leading closed-source commercial models on ALibaba's proprietory Wan-Bench 2.0.
Wan2.2 compared to leading closed-source commercial models on ALibaba’s proprietory Wan-Bench 2.0.

Perhaps the most significant part of the release for accessibility is the new TI2V-5B model, a compact 5-billion-parameter version designed for efficient deployment. This hybrid model natively supports both text-to-video and image-to-video tasks within a single unified framework. Its efficiency is driven by a new high-compression VAE (Variational Autoencoder) that achieves a remarkable compression ratio, making high-definition video generation feasible on non-enterprise hardware.

This breakthrough allows the TI2V-5B model to generate 720p video at 24fps on consumer-grade GPUs like the NVIDIA RTX 4090, requiring less than 24GB of VRAM. This brings advanced AI video tools to a much broader audience of developers, researchers, and creators. To accelerate this adoption, the Wan2.2 models have already been integrated into popular community tools, including ComfyUI and Hugging Face Diffusers.

Alibaba’s decision to release Wan2.2 under a permissive Apache 2.0 license is a direct strategic challenge to the closed, proprietary models that dominate the high end of the market. Companies like OpenAI and Google have kept their most advanced video models, Sora and Veo, behind paywalls and APIs.

By providing a powerful, free alternative, Alibaba is escalating the competition and betting that an open ecosystem will foster faster innovation and wider adoption. This strategy mirrors the disruption seen in AI image generation, where open-source models have become formidable competitors to closed systems.

Part of a Broader AI Ecosystem Offensive

The Wan2.2 launch is not an isolated event. It is the latest move in a rapid-fire series of major AI releases from Alibaba, signaling a comprehensive offensive to establish itself as a leader across multiple AI domains. This flurry of activity demonstrates a clear strategy to build a full suite of open tools for developers.

In the week prior, the company unveiled its new flagship reasoning model, Qwen3-Thinking-2507, which topped key industry benchmarks. It also launched a powerful agentic coding model, Qwen3-Coder, for automating software development tasks.

This strategic pivot was underscored by a statement from Alibaba Cloud, which explained its decision to abandon the “hybrid thinking” mode of earlier models. A spokesperson said, “after discussing with the community and reflecting on the matter, we have decided to abandon the hybrid thinking mode. We will now train the Instruct and Thinking models separately to achieve the best possible quality.”

To showcase the real-world application of its AI, Alibaba also previewed its new “Quark AI” smart glasses. The wearables are powered by the Qwen3 series, a move designed to build market confidence by connecting its software prowess to a tangible consumer product.

Song Gang of Alibaba’s Intelligent Information business group shared his vision for the technology, stating, “ai glasses will become the most important form of wearable intelligence – it will serve as another pair of eyes and ears for humans.”

A Timely Launch Amid Benchmark Skepticism

However, this aggressive push comes at a time of growing industry skepticism about the reliability of AI benchmarks. Just days before the recent Qwen releases, a study alleged that Alibaba’s older Qwen2.5 model had “cheated” on a key math test by memorizing answers from contaminated training data.

The controversy highlights a systemic issue of “teaching to the test” in the race for leaderboard dominance. As AI strategist Nate Jones noted, “the moment we set leaderboard dominance as the goal, we risk creating models that excel in trivial exercises and flounder when facing reality.” This sentiment is echoed by experts like Sara Hooker, Head of Cohere Labs, who argued that “when a leaderboard is important to a whole ecosystem, the incentives are aligned for it to be gamed.”

While Alibaba has moved on to newer models like Qwen3, the allegations cast a shadow over the “benchmark wars” that define AI competition. The Wan2.2 release, with its focus on tangible capabilities and accessibility, may be an attempt to shift the narrative from leaderboard scores to real-world utility and open innovation.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous Article20 national security experts urge Trump administration to restrict Nvidia H20 sales to China
Next Article IBM Partners with Datavault AI to Launch Enterprise Data Monetization Platform
Advanced AI Editor
  • Website

Related Posts

SoulGen Debuts AI-Powered Image-to-Video Platform, Ushering

July 30, 2025

Google unveils Veo 3 Fast, an upgraded AI model for faster video generation | Technology News

July 30, 2025

Google’s Veo 3 and Veo 3 Fast Video Generation Models Now Generally Available to Developers via Vertex AI

July 30, 2025

Comments are closed.

Latest Posts

Trump’s ‘Big Beautiful Bill’ Orders Museum to Relocate Space Shuttle

Thomas Kinkade Foundation Denounces DHS’s Usage of Painting

Three Convicted for Stealing Ancient Celtic Coins from German Museum

Millennial and Gen Z Gallerists Looking to ‘Redefine Success’ and more

Latest Posts

Aurora Mobile’s GPTBots.ai to Integrate Zhipu AI’s Flagship GLM-4.5 Model to Enhance AI Capabilities

July 30, 2025

Google unveils Gemma, a new family of open AI models for devs

July 30, 2025

IBM: Shadow AI breaches cost $670K more, 97% of firms lack controls

July 30, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Aurora Mobile’s GPTBots.ai to Integrate Zhipu AI’s Flagship GLM-4.5 Model to Enhance AI Capabilities
  • Google unveils Gemma, a new family of open AI models for devs
  • IBM: Shadow AI breaches cost $670K more, 97% of firms lack controls
  • How an MIT program planted the roots for Open Range and continues to shape Omaha’s startup ecosystem
  • Meta to spend up to $72B on AI infrastructure in 2025 as compute arms race escalates

Recent Comments

  1. 🔏 Security - Transfer 1.8 BTC incomplete. Fix here >> https://graph.org/OBTAIN-CRYPTO-07-23?hs=85ce984e332839165eff00f10a4fc17a& 🔏 on The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies (Paper Explained)
  2. 💾 System: Transfer 0.5 Bitcoin incomplete. Verify now >> https://graph.org/OBTAIN-CRYPTO-07-23?hs=e1378433e58a7b696e3632102c97ef63& 💾 on Qwen 2.5 Coder and Qwen 3 Lead in Open Source LLM Over DeepSeek and Meta
  3. 📞 Security; Transaction 0.5 BTC failed. Verify now => https://graph.org/OBTAIN-CRYPTO-07-23?hs=ec8b72524f993be230f3c8fd50d7bbae& 📞 on OpenAI Five: Dota Gameplay
  4. 📨 System: Transfer 0.5 Bitcoin on hold. Verify now => https://graph.org/OBTAIN-CRYPTO-07-23?hs=b25dab3fe579278f363cd6d123369e86& 📨 on New ChatGPT voice mode updates ⬇️
  5. 🖊 System; Deposit 0.3 Bitcoin failed. Authorize here => https://graph.org/OBTAIN-CRYPTO-07-23?hs=e9fac00a4f303105cc60c701c8ee35b9& 🖊 on Meta, Booz Allen develop ‘Space Llama’ AI system for the International Space Station

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.