Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Don’t Leave America! Microsoft, JP Morgan, Amazon, IBM, and Apple Caution H1B & H4 Techies

Lincoln Center’s Collider Fellows explore how tech could transform the performing arts

Assessing Valuation as Leadership Changes, Revenue Slide, and Lawsuits Shake Investor Confidence

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
DeepSeek

China’s DeepSeek Challenges US AI Costs with Low-Cost Training Model – Space/Science news

By Advanced AI EditorSeptember 20, 2025No Comments2 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


The disclosure appeared in a peer-reviewed article published Wednesday in Nature, marking the first time the Hangzhou-based company revealed details of its training costs.

DeepSeek’s release of lower-cost AI systems earlier this year unsettled global tech markets, with investors fearing the models could erode the position of US giants such as Nvidia.

The Nature article, co-authored by founder Liang Wenfeng, said the R1 was trained using 512 Nvidia H800 chips and took 80 hours to complete. A previous January version of the paper omitted cost details.

Training large-language models typically requires weeks of computation on powerful processors, often costing tens or even hundreds of millions of dollars. OpenAI chief executive Sam Altman said in 2023 that foundational model training had cost “much more” than $100 million, without providing specifics.

Washington has questioned DeepSeek’s claims. US officials told Reuters in June the company held “large volumes” of Nvidia’s high-end H100 chips despite American export bans. Nvidia said DeepSeek lawfully used H800 chips, while DeepSeek acknowledged for the first time that it also possessed A100 chips, employed in preliminary development stages.

DeepSeek’s access to advanced processors has helped it attract leading Chinese researchers, Reuters has previously reported.

The company also addressed allegations it had copied OpenAI’s models. US officials and industry figures suggested in January that DeepSeek “distilled” OpenAI’s technology into its own.

DeepSeek defended the practice, saying distillation improves performance and reduces costs, making AI more accessible. The method allows one AI to learn from another’s outputs, leveraging prior investment while cutting expenses.

The firm acknowledged using Meta’s open-source Llama for some versions of its models. It also noted that training data for its V3 model included web content containing OpenAI-generated answers, but said this was incidental rather than deliberate.

OpenAI did not respond to Reuters’ request for comment.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleGoogle DeepMind Releases MoR Architecture, Significantly Enhancing Inference Efficiency of Large Models_the_large_models
Next Article OpenAI leads private market surge as 7 startups reach $1.3 trillion
Advanced AI Editor
  • Website

Related Posts

In Other News: 600k Hit by Healthcare Breaches, Major ShinyHunters Hacks, DeepSeek’s Coding Bias

September 19, 2025

Huawei co-develops safety-focused DeepSeek model to block politically sensitive topics

September 19, 2025

China’s DeepSeek shook the tech world. Its developer just revealed the cost of training the AI model

September 19, 2025

Comments are closed.

Latest Posts

Acquavella Signs Harumi Klossowska de Rola, Daughter of Balthus

Heirs of Jewish Collector Urge Court to Reconsider Claim to Sunflowers

Art World Figures Remember Agnes Gund: ‘a Legend and Icon’

Bizarre Trump Bitcoin Statue Appears in Washington, D.C.

Latest Posts

Don’t Leave America! Microsoft, JP Morgan, Amazon, IBM, and Apple Caution H1B & H4 Techies

September 20, 2025

Lincoln Center’s Collider Fellows explore how tech could transform the performing arts

September 20, 2025

Assessing Valuation as Leadership Changes, Revenue Slide, and Lawsuits Shake Investor Confidence

September 20, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Don’t Leave America! Microsoft, JP Morgan, Amazon, IBM, and Apple Caution H1B & H4 Techies
  • Lincoln Center’s Collider Fellows explore how tech could transform the performing arts
  • Assessing Valuation as Leadership Changes, Revenue Slide, and Lawsuits Shake Investor Confidence
  • Wall Street eyeing one big trade after Fed rate cut: Commodities
  • A Closer Look at an MIT Study

Recent Comments

  1. cocaine-prague-614 on C3.ai Stock Dips Following Palantir Technologies Earnings: What’s Going On? – C3.ai (NYSE:AI)
  2. wetten dass gewinner heute on A Library of LLM Intrinsics for Retrieval-Augmented Generation
  3. Sites.Google.com on Accelerating Job Searches And Career Transitions
  4. fluffycrab3Nalay on AI as a Service: Top AIaaS Vendors for All Types of Businesses (2025)
  5. fluffycrab3Nalay on Apple’s Lack Of New AI Features At WWDC Is ‘Startling,’ Expert Says – Apple (NASDAQ:AAPL)

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.