Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

HPE Expands NVIDIA AI Enterprise Integration with Blackwell GPU Solutions

Elon Musk cries antitrust as X & Grok can’t compete with OpenAI

IBM relocates thousands of employees to One Madison Ave

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
DeepSeek

DeepSeek R2 reasoning AI is coming soon, and it could make waves again

By Advanced AI EditorApril 28, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


A few months ago, DeepSeek stunned the world, crashing the US stock market in the process. The Chinese AI company released DeepSeek R1, a reasoning model that was just as powerful as ChatGPT o1 despite costing practically nothing by comparison to create and train. With limited access to powerful processors, DeepSeek came up with software optimizations for its DeepSeek AI models and used less powerful NVIDIA GPUs to train its AI. Also, DeepSeek released DeepSeek AI models as open source, which means anyone could install them for free on their computers and run them without connecting to the internet.

That explains why DeepSeek tanked the market. Suddenly, it appeared that access to high-end hardware wasn’t the moat that would protect the advancements of US AI firms. China could suddenly compete, too.

It wasn’t without controversy, as OpenAI accused DeepSeek of training its AI with the help of data from ChatGPT. Also, the DeepSeek AI apps posed security and privacy worries, as all user data from its mobile apps is sent to China. That’s why installing DeepSeek on a computer rather than using a dedicated app is a better option for users.

A few months later, rumors are swirling that DeepSeek is on the verge of releasing DeepSeek R2, its next-gen reasoning model that should compete against OpenAI’s recently released o3 and o4-mini. Will DeepSeek tank the stock market again? That’s difficult to predict, though it’s safe to say Trump’s tariffs have done far more damage than DeepSeek ever could to the US stock market.

Tech. Entertainment. Science. Your inbox.

Sign up for the most interesting tech & entertainment news out there.

By signing up, I agree to the Terms of Use and have reviewed the Privacy Notice.

Once the panic subsided, it was clear that the worries about hardware no longer mattering as much for frontier AI development were misplaced. Yes, software innovations are always possible, but that won’t stop the likes of Nvidia from making next-gen AI chips or US AI firms from buying them.

The market is still hurting, so DeepSeek R2 can’t possibly deal a blow to the US economy like its predecessor did. The world already knows the Chinese AI startup is using a different strategy than OpenAI because of the limitations it’s facing. The world also probably expects DeepSeek R2 to be more efficient than rivals.

What the world doesn’t necessarily expect is DeepSeek training R2 on chips coming from Huawei rather than Nvidia. That’s what rumors say right now, and that’s the kind of shock AI chip makers like Nvidia might feel.

Also, reports detailing DeepSeek R2 say the company has developed a local supply chain to meet its AI hardware needs, which would reduce reliance on external partners and ensure rapid infrastructure development.

The word on the street is that DeepSeek R2 is a massive 1.2 trillion-parameter model. However, the reasoning AI will use only 78 billion parameters per token thanks to its hybrid MoE (Mixture-of-Experts) architecture.

This should improve costs, and rumors say that DeepSeek R2 is 97.3% cheaper to train than GPT-4. Inference costs also dropped by the same percentage. Rumors say DeepSeek R2 will cost about $0.07 per million input tokens and $0.27 per million output tokens.

Rumors also say that DeepSeek R2 scores high in benchmarks. DeepSeek reportedly trained the new reasoning model on 5.2 petabytes of high-end data (including finance, law, and patents). DeepSeek used Huawei Ascend 910B chips to train R2.

DeepSeek R2 should show strong reasoning capabilities, including multimodal support with high-end vision abilities.

That’s what the rumors say, at least. There’s no telling when DeepSeek R2 might be released, though all signs point to early May or the weeks thereafter.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleAlibaba Steps Up AI Game With Qwen 3, Challenging DeepSeek’s Low-Cost AI Success
Next Article Le Chat, the cat-bot France has pinned its AI hopes on
Advanced AI Editor
  • Website

Related Posts

What is DeepSeek? All about China’s latest AI model

August 11, 2025

Zetrix Develops Shariah-Compliant NurAI LLM With DeepSeek

August 11, 2025

China’s infrastructure enters ‘DeepSeek moment’

August 11, 2025
Leave A Reply

Latest Posts

Midjourney Slams Lawsuit Filed by Disney to Prevent AI Training

Smithsonian Updates Museum Display on Impeachment To Include Trump

Funder Tried to Hijack Kandinsky Art Theft Suits, Says Collector

How to Stylize Your Images with Flux Kontext in ComfyUI

Latest Posts

HPE Expands NVIDIA AI Enterprise Integration with Blackwell GPU Solutions

August 12, 2025

Elon Musk cries antitrust as X & Grok can’t compete with OpenAI

August 12, 2025

IBM relocates thousands of employees to One Madison Ave

August 12, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • HPE Expands NVIDIA AI Enterprise Integration with Blackwell GPU Solutions
  • Elon Musk cries antitrust as X & Grok can’t compete with OpenAI
  • IBM relocates thousands of employees to One Madison Ave
  • Creating uniquely human digital banking experiences at TD
  • C3 AI Stock Plunges After ‘Completely Unacceptable’ Q1 Sales – C3.ai (NYSE:AI)

Recent Comments

  1. EdwardEnror on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. ThomasWep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. ThomasWep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. EdwardEnror on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. ThomasWep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.