Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Tencent unveils new AI model ‘Hunyuan T1’ that rivals DeepSeek R1 in performance and price

Perplexity Comet Vs Google Chrome — Should You Switch To An AI Browser?

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs – Takara TLDR

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
DeepSeek

DeepSeek R2 reasoning AI is coming soon, and it could make waves again

By Advanced AI EditorApril 28, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


A few months ago, DeepSeek stunned the world, crashing the US stock market in the process. The Chinese AI company released DeepSeek R1, a reasoning model that was just as powerful as ChatGPT o1 despite costing practically nothing by comparison to create and train. With limited access to powerful processors, DeepSeek came up with software optimizations for its DeepSeek AI models and used less powerful NVIDIA GPUs to train its AI. Also, DeepSeek released DeepSeek AI models as open source, which means anyone could install them for free on their computers and run them without connecting to the internet.

That explains why DeepSeek tanked the market. Suddenly, it appeared that access to high-end hardware wasn’t the moat that would protect the advancements of US AI firms. China could suddenly compete, too.

It wasn’t without controversy, as OpenAI accused DeepSeek of training its AI with the help of data from ChatGPT. Also, the DeepSeek AI apps posed security and privacy worries, as all user data from its mobile apps is sent to China. That’s why installing DeepSeek on a computer rather than using a dedicated app is a better option for users.

A few months later, rumors are swirling that DeepSeek is on the verge of releasing DeepSeek R2, its next-gen reasoning model that should compete against OpenAI’s recently released o3 and o4-mini. Will DeepSeek tank the stock market again? That’s difficult to predict, though it’s safe to say Trump’s tariffs have done far more damage than DeepSeek ever could to the US stock market.

Tech. Entertainment. Science. Your inbox.

Sign up for the most interesting tech & entertainment news out there.

By signing up, I agree to the Terms of Use and have reviewed the Privacy Notice.

Once the panic subsided, it was clear that the worries about hardware no longer mattering as much for frontier AI development were misplaced. Yes, software innovations are always possible, but that won’t stop the likes of Nvidia from making next-gen AI chips or US AI firms from buying them.

The market is still hurting, so DeepSeek R2 can’t possibly deal a blow to the US economy like its predecessor did. The world already knows the Chinese AI startup is using a different strategy than OpenAI because of the limitations it’s facing. The world also probably expects DeepSeek R2 to be more efficient than rivals.

What the world doesn’t necessarily expect is DeepSeek training R2 on chips coming from Huawei rather than Nvidia. That’s what rumors say right now, and that’s the kind of shock AI chip makers like Nvidia might feel.

Also, reports detailing DeepSeek R2 say the company has developed a local supply chain to meet its AI hardware needs, which would reduce reliance on external partners and ensure rapid infrastructure development.

The word on the street is that DeepSeek R2 is a massive 1.2 trillion-parameter model. However, the reasoning AI will use only 78 billion parameters per token thanks to its hybrid MoE (Mixture-of-Experts) architecture.

This should improve costs, and rumors say that DeepSeek R2 is 97.3% cheaper to train than GPT-4. Inference costs also dropped by the same percentage. Rumors say DeepSeek R2 will cost about $0.07 per million input tokens and $0.27 per million output tokens.

Rumors also say that DeepSeek R2 scores high in benchmarks. DeepSeek reportedly trained the new reasoning model on 5.2 petabytes of high-end data (including finance, law, and patents). DeepSeek used Huawei Ascend 910B chips to train R2.

DeepSeek R2 should show strong reasoning capabilities, including multimodal support with high-end vision abilities.

That’s what the rumors say, at least. There’s no telling when DeepSeek R2 might be released, though all signs point to early May or the weeks thereafter.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleAlibaba Steps Up AI Game With Qwen 3, Challenging DeepSeek’s Low-Cost AI Success
Next Article Le Chat, the cat-bot France has pinned its AI hopes on
Advanced AI Editor
  • Website

Related Posts

When You Tell AI Models to Act Like Women, Most Become More Risk-Averse: Study

October 12, 2025

Ant Group Launches Ling-1T: China’s Trillion-Parameter AI Model to Rival OpenAI and DeepSeek

October 10, 2025

New York-Based Reflection AI Raises $2B, Hits $8B Valuation

October 9, 2025
Leave A Reply

Latest Posts

Smithsonian Closes Museums Amid Government Shutdown

The Rubin Names 2025 Art Prize, Research and Art Projects Grants

Kochi-Muziris Biennial Announces 66 Artists for December Exhibition

Instagram Launches ‘Rings’ Awards for Creators—With KAWS as a Judge

Latest Posts

Tencent unveils new AI model ‘Hunyuan T1’ that rivals DeepSeek R1 in performance and price

October 12, 2025

Perplexity Comet Vs Google Chrome — Should You Switch To An AI Browser?

October 12, 2025

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs – Takara TLDR

October 12, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Tencent unveils new AI model ‘Hunyuan T1’ that rivals DeepSeek R1 in performance and price
  • Perplexity Comet Vs Google Chrome — Should You Switch To An AI Browser?
  • When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs – Takara TLDR
  • Anthropic’s ‘anti-China’ stance triggers exit of star AI researcher
  • The Future of Private Capital Markets: How PitchBook Is Shaping Global Investing

Recent Comments

  1. ChillgerN4Nalay on An improved Large-scale 3D Vision Dataset for Compositional Recognition
  2. ChillgerN4Nalay on Reverse Engineering The IBM PC110, One PCB At A Time
  3. ChillgerN4Nalay on OpenAI expects subscription revenue to nearly double to $10bn
  4. EchoVortexE3Nalay on Study: AI-Powered Research Prowess Now Outstrips Human Experts, Raising Bioweapon Risks
  5. وی ناترند ۱ کیلویی on Meta, Booz Allen Launch ‘Space Llama’ AI System For Space Station Operations – Meta Platforms (NASDAQ:META), Booz Allen Hamilton (NYSE:BAH)

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.