Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Beijing Is Using Soft Power to Gain Global Dominance

Alibaba previews its first AI-powered glasses, joining China’s heated smart wearable race

Monitor AI’s Decision-Making Black Box: Here’s Why

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
CBInsights AI

What DeepSeek’s model releases mean for the future of AI

By Advanced AI EditorApril 2, 2025No Comments6 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


We dig into the implications of China-based DeepSeek’s rapid ascent on the AI infrastructure landscape, the private sector, and enterprise AI strategies.

China’s DeepSeek has upended assumptions about what it takes to develop powerful AI models. 

The AI company, which emerged from Liang Wenfeng’s hedge fund High-Flyer, released an open-source reasoning model (named R1) in January 2025 that rivals the performance of OpenAI’s o1 reasoning model.

DeepSeek says it trained its base model with limited chips and about $5.6M in computing power — a fraction of the $100M+ US rivals have spent training similar models — thanks to some clever techniques.  

That efficiency has raised questions about the scale of US companies’ AI investments. Increased spend on AI infrastructure has driven big tech’s combined capex past $50B in recent quarters, while venture investors poured $76.3B into US AI startups in 2024 alone, per CB Insights data. 

Below, we dive into 5 key trends highlighted by DeepSeek’s rise: 

AI infrastructure costs come under scrutiny
VCs and private AI sector face recalibration
Amid funding gap in China, restrictions force innovative development
Open-source ecosystem gains steam, with China making strides
Enterprises rethink AI strategies for open models

1. AI infrastructure costs come under scrutiny   

In the US, genAI development has raced ahead thanks to billions of dollars in funding. 

Big tech companies have justified their spend on AI infrastructure based on the need for more hardware (like chips) and energy to train bigger and more performant models.  

DeepSeek’s reported ability to develop similarly powerful models much more efficiently (though questions remain about the true cost of its development) is upsetting these assumptions. 

It’s also sending shockwaves through the public markets. On Monday, January 27, Nvidia’s stock fell more than 15% as investors reacted to R1. Other stocks linked to the AI value chain, including infrastructure suppliers like Oracle and power producers like Constellation Energy, also saw steep declines. 

But in the long run, as AI infrastructure and operational costs decline, major tech companies stand to benefit from the expanding market. The decreasing cost barriers will accelerate enterprise AI adoption, allowing cloud providers like Microsoft and Amazon to capture growing demand through more competitively priced AI services.

Increasing AI usage would also benefit companies at the inference layer. While Nvidia, AMD, and Intel dominate the AI inference processors market (chips that run already trained models), startups like d-Matrix and Groq are making progress, particularly in power efficiency. In July 2024, Groq raised a $640M Series D. 

CBI iconExplore the AI data center value chain

2. VCs and private AI sector face recalibration 

DeepSeek’s advances could undercut the vast sums of money that have gone to foundation model developers. OpenAI and Anthropic alone have raised over $30B in funding. 

Going forward, it may be harder for AI startups to justify raising huge funding rounds to support their infrastructure buildouts.

CBI icon Evaluate 30+ LLM developers

At the same time, US developers maintain an advantage in compute and data, and they will likely adopt DeepSeek’s architecture changes. This may widen their performance lead if they can combine their superior resources with new efficiency gains.

DeepSeek also appears to have relied on outputs from OpenAI models to train its models, which would violate OpenAI’s terms of service — DeepSeek’s chatbot even self-identifies as an OpenAI product. Without OpenAI’s releases (including its o1 reasoning model), DeepSeek’s R1 model likely wouldn’t have emerged.

3. Amid funding gap in China, restrictions force innovative development 

AI startup funding in China is a fraction of what US AI startups raise: China saw $5.2B in AI funding in 2024, or 7% of the US’ $76.3B. 

And it’s cooled over time as the Chinese government has cracked down on its private sector in recent years.

The US has also put caps on exports of advanced AI chips, like those from Nvidia, which are key to model development. 

The crunch has forced companies like DeepSeek to get creative. (Note: It’s estimated that High-Flyer acquired anywhere from 10K to 50K Nvidia A100 chips prior to the sanctions, which helped launch DeepSeek.) 

Rather than using more powerful hardware (like H100 chips), DeepSeek focused on making extremely efficient use of more constrained hardware through careful optimization at multiple levels — from model architecture to low-level GPU programming — in the development of its 671B parameter V3 model. This demonstrated that superior results could be achieved through better engineering rather than just using more powerful hardware. 

Meanwhile, its R1 model (which uses V3 as the base model) showcased how reinforcement learning without extensive labeled data could achieve high-quality reasoning capabilities, challenging the notion that expensive training with human feedback was necessary for strong performance.

Other companies to watch in China include the “AI tigers” Moonshot AI, Zhipu AI, Baichuan AI, MiniMax, and 01.AI — all of which have nabbed $1B+ valuations backed by China’s big tech companies.

It will be worth keeping an eye on China’s AI ecosystem as well for the eventual AI applications that take root — especially those that become “killer” consumer apps. 

For example, MiniMax has released AI apps in the US featuring avatar chatbots and video generation tools (though its app Talkie was removed from Apple’s App Store in December). In another indication that Chinese AI startups are pushing into the US consumer market, DeepSeek’s mobile app is currently the top free app in the US App Store on iOS.  

4. Open-source ecosystem gains steam, with China making strides

DeepSeek’s advances also give more fuel to the open-source movement — highlighting that open, frontier models can be developed with more modest resources and computing infrastructure. 

We previously dug into how the assumption of increasing model training costs was tipping the model race in favor of closed-source developers. Since 2020, closed-source AI model developers have secured $37.5B in venture funding, while open-source developers have trailed with $14.9B. However, DeepSeek’s efficient training approach challenges the assumption that massive funding is necessary for competitive model development.

Open-source vs. closed-source model developers tearsheet

Overall, in the last few months, China has been closing the gap in model performance vs. US rivals, especially among open-source models. Based on our analysis from earlier this year, Alibaba’s open-source Qwen-2.5 had made it on the leaderboard, alongside primarily closed-source models from US developers. Today, when including reasoning models, DeepSeek’s R1 takes the top spot, followed by GPT-o1-mini.

If China continues to contribute top-ranking open-source models, that could encourage US developers to build on top of its technology — increasing China’s importance in the AI development landscape.

CBI icon Track AI model developers in China



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleOpenAI plans to release a new ‘open’ AI language model in the coming months
Next Article EU Invests €1.3 Billion to Boost AI Adoption & Improve ‘Digital Competencies’
Advanced AI Editor
  • Website

Related Posts

Automotive AI readiness: Xiaomi, Tesla, and Toyota lead the transformation

July 24, 2025

AI agent startups are becoming revenue machines — here are the top 20 ranked

July 23, 2025

State of Venture Q2’25 Report

July 11, 2025
Leave A Reply

Latest Posts

David Geffen Sued By Estranged Husband for Breach of Contract

Auction House Will Sell Egyptian Artifact Despite Concern From Experts

Anish Kapoor Lists New York Apartment for $17.75 M.

Street Fighter 6 Community Rocked by AI Art Controversy

Latest Posts

Beijing Is Using Soft Power to Gain Global Dominance

July 27, 2025

Alibaba previews its first AI-powered glasses, joining China’s heated smart wearable race

July 27, 2025

Monitor AI’s Decision-Making Black Box: Here’s Why

July 27, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Beijing Is Using Soft Power to Gain Global Dominance
  • Alibaba previews its first AI-powered glasses, joining China’s heated smart wearable race
  • Monitor AI’s Decision-Making Black Box: Here’s Why
  • ChatGPT therapy conversations may not be private, warns OpenAI CEO Sam Altman
  • For Now, AI Helps IBM’s Bottom Line More Than Its Top Line

Recent Comments

  1. Rejestracja on Online Education – How I Make My Videos
  2. Anonymous on AI, CEOs, and the Wild West of Streaming
  3. MichaelWinty on Local gov’t reps say they look forward to working with Thomas
  4. 4rabet mirror on Former Tesla AI czar Andrej Karpathy coins ‘vibe coding’: Here’s what it means
  5. Janine Bethel on OpenAI research reveals that simply teaching AI a little ‘misinformation’ can turn it into an entirely unethical ‘out-of-the-way AI’

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.