Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Paper page – Whole-Body Conditioned Egocentric Video Prediction

NVIDIA H20 Chip Shortage Delays DeepSeek R2 Launch

TITAN: Query-Token based Domain Adaptive Adversarial Learning

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • Amazon (Titan)
    • Anthropic (Claude 3)
    • Cohere (Command R)
    • Google DeepMind (Gemini)
    • IBM (Watsonx)
    • Inflection AI (Pi)
    • Meta (LLaMA)
    • OpenAI (GPT-4 / GPT-4o)
    • Reka AI
    • xAI (Grok)
    • Adobe Sensi
    • Aleph Alpha
    • Alibaba Cloud (Qwen)
    • Apple Core ML
    • Baidu (ERNIE)
    • ByteDance Doubao
    • C3 AI
    • DataRobot
    • DeepSeek
  • AI Research & Breakthroughs
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Education AI
    • Energy AI
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Media & Entertainment
    • Transportation AI
    • Manufacturing AI
    • Retail AI
    • Agriculture AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
Facebook X (Twitter) Instagram
Advanced AI News
DeepSeek

Report: DeepSeek’s newest model delayed due to GPU export restrictions

Advanced AI EditorBy Advanced AI EditorJune 27, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


China’s top artificial intelligence company DeepSeek Ltd. has reportedly come unstuck in its efforts to develop its next-generation R2 reasoning model, because it cannot get its hands on enough of Nvidia Corp.’s graphics processing units, according to a report.

The Information cited two anonymous sources who are familiar with DeepSeek’s efforts as saying that the company has been working on the upcoming R2 model for several months, but its Chief Executive Liang Wengfeng is not yet satisfied with it. However, the company cannot improve its capabilities with the limited number of GPUs at its disposal.

DeepSeek shot to fame earlier this year when it debuted its original reasoning model R1, which proved to be more than a match for the most advanced models developed by U.S. companies like OpenAI, Anthropic PBC and Meta Platforms Inc., despite being built at a fraction of the cost.

According to The Information, DeepSeek trained R1 on a cluster of 50,000 Hopper GPUs, which included around 10,000 H100s, 10,000 H800s, and around 30,000 of the lower-powered H20 GPUs that were purpose-built for the Chinese market.

Chinese companies have never been able to purchase the H100 or H800 GPUs legally, and it’s thought that some of them were secretly supplied to DeepSeek by its investor High-Flyer Capital Management, while others were procured via shell companies that access public cloud infrastructure services. The H20 GPUs were obtained legally, but they have since become hard to come by due to new sanctions by the U.S. government that prohibit their export to China.

Part of the problem is that many of the H20 GPUs in China are already being used by DeepSeek’s customers. The Information says the R1 model has been widely adopted by Chinese companies and government agencies, and most of them run it on H20 GPUs in the cloud. So there’s no more capacity available for DeepSeek to train its latest model.

It’s said that the H20 GPU shortages are already causing problems with R1, limiting how it is used by Chinese firms. If the R2 model significantly improves on R1, it’s expected that the demand for the model will increase beyond what Chinese cloud infrastructure providers can handle, according to staff interviewed by The Information.

The H20 processor is comparable to the H100 GPU that Nvidia sells to western companies, but its bandwidth and connectivity had been throttled to meet earlier restrictions on the types of chips that could be exported to China. However, President Donald Trump’s administration decided that even this scaled-down chip is too powerful to be shipped to its geopolitical rival, and promptly slapped new restrictions on the country in April, banning its export there.

That decision has reportedly thrown a major spanner in the works of Chinese AI developers. While there are some domestic alternatives available, such as Huawei Technologies Co. Ltd.’s Ascend 910B chipset, these are even less powerful than the H20 and they lack support for Nvidia’s CUDA software stack – a programming architecture that’s used to optimize applications and AI models to run on Nvidia’s GPUs. That’s problematic because virtually all Chinese AI developers are thought to be using the CUDA software.

The Information says DeepSeek’s R1 and R2 models are also optimized for Nvidia’s chips, and its inability to access them could prove to be a major setback in its efforts to keep pace with its U.S. rivals.

Image: SiliconANGLE/Dreamina

Support our open free content by sharing and engaging with our content and community.

Join theCUBE Alumni Trust Network

Where Technology Leaders Connect, Share Intelligence & Create Opportunities

11.4k+  

CUBE Alumni Network

C-level and Technical

Domain Experts

Connect with 11,413+ industry leaders from our network of tech and business leaders forming a unique trusted network effect.

SiliconANGLE Media is a recognized leader in digital media innovation serving innovative audiences and brands, bringing together cutting-edge technology, influential content, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — such as those established in Silicon Valley and the New York Stock Exchange (NYSE) — SiliconANGLE Media operates at the intersection of media, technology, and AI. .

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a powerful ecosystem of industry-leading digital media brands, with a reach of 15+ million elite tech professionals. The company’s new, proprietary theCUBE AI Video cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleMint Quick Edit | Nvidia’s market cap crown: What it means for AI
Next Article New AI upgrades, innovations, and solutions unveiled at the Tencent Global Digital Ecosystem Summit
Advanced AI Editor
  • Website

Related Posts

NVIDIA H20 Chip Shortage Delays DeepSeek R2 Launch

June 27, 2025

DeepSeek R2 launch stalled as CEO balks at progress: Report

June 27, 2025

DeepSeek’s R2 model reportedly delayed over Nvidia chip shortages

June 26, 2025
Leave A Reply Cancel Reply

Latest Posts

At Proper Hotels, Come For Vacation, Stay For The Live Music

New EU Law Aimed at Art Trafficking Goes Into Effect on June 28

Peek Inside ‘Leading Hotels Of The World’ With Luxe Travel Book ‘Culture’

Marcia Resnick, Photographer of Downtown Manhattan Scene, Dies at 74

Latest Posts

Paper page – Whole-Body Conditioned Egocentric Video Prediction

June 27, 2025

NVIDIA H20 Chip Shortage Delays DeepSeek R2 Launch

June 27, 2025

TITAN: Query-Token based Domain Adaptive Adversarial Learning

June 27, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Paper page – Whole-Body Conditioned Egocentric Video Prediction
  • NVIDIA H20 Chip Shortage Delays DeepSeek R2 Launch
  • TITAN: Query-Token based Domain Adaptive Adversarial Learning
  • Meet the unsung Wimbledon champion who worked courtside collecting data for IBM
  • What enterprise leaders can learn from LinkedIn’s success with AI agents

Recent Comments

No comments to show.

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

YouTube LinkedIn
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.