Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions – Takara TLDR

Elon Musk’s AI War With OpenAI Explained As Rift Intensifies, Lands In Court

OpenAI is the world’s most valuable private company after private stock sale

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
DeepSeek

U.S. Commerce Sec. Lutnick says American AI dominates DeepSeek, thanks Trump for AI Action Plan — OpenAI and Anthropic beat Chinese models across 19 different benchmarks

By Advanced AI EditorOctober 3, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


When you buy through links on our articles, Future and its syndication partners may earn a commission.

 Deepseek logo on an iPhone.

Credit: Getty / Herstockart

The National Institute of Science and Technology (NIST) has just completed a comprehensive test of Chinese and American AI models, with the results showing that models from OpenAI and Anthropic outperformed DeepSeek across 19 different benchmarks. U.S. Commerce Secretary Howard Lutnick shared the results on X, thanking President Donald Trump for his AI Action Plan to accelerate American AI innovation and infrastructure while encouraging its allies and friendly nations to adopt it.

“The report is clear: DeepSeek lags far behind, especially in cyber and software engineering. These weaknesses aren’t just technical. They demonstrate why relying on foreign AI is dangerous and shortsighted,” Sec. Lutnick said in his post. “Allowing our adversaries to control AI poses serious risks to our security. By setting the standards, driving innovation, and keeping America secure, the Department of Commerce is helping ensure continued U.S. leadership in AI.”

NIST is a federal agency under the Commerce Department that develops standards and supports industry to help keep the U.S. industrially competitive globally, and it conducted this study under the newly-established Center for AI Standards and Innovation (CAISI).

The tests pitted the R1, R1-0528, and V3.1 DeepSeek models (crucially not DeepSeek’s new V3.2 released this week) against OpenAI’s GPT-5, GPT-5-mini, and GPT-oss, and Anthropic’s Opus 4, using 19 different benchmarks. These publicly available tests include SWE-bench Verified and Breakpoint for software engineering, MMLU-Pro and GPQA for general knowledge capabilities, SMT 2025, PUMaC 2024, and OTIS-AIME 2025 math contests for mathematical reasoning, and the AgentDojo framework for hijacking attack resilience. Aside from this, the institution also customized and developed its own custom assessments to test for things like CCP censorship, as there’s no standard test for that.

All the results were outlined in a 69-page document [PDF], with CAISI saying that OpenAI and Anthropic outperform DeepSeek in all tests, but most especially in software engineering and cyber tasks. The U.S. AI models generally outperform DeepSeek by 20 to 80%, and cost around 35% less to operate. The latter is also easier to hijack and jailbreak, making it more susceptible to acting unintentionally. The report also said that Chinese models are biased and that they toe the line when it comes to messaging from Beijing, although it’s worth bearing in mind that other AI benchmarking tools exist that might yield different results.

Despite all this, DeepSeek R1 is continuously being adopted, with CAISI saying that the “use of these models may pose a risk to application developers, to consumers, and to U.S. national security.” Beyond that, the Chinese AI company is continuously releasing new models, with DeepSeek-V3.2-Exp being released earlier this week, possibly rendering some of these tests moot.

Follow Tom’s Hardware on Google News to get our up-to-date news, analysis, and reviews in your feeds. Make sure to click the Follow button.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleSam Altman Has Become the Star of OpenAI’s Sora App
Next Article Optimal Control Meets Flow Matching: A Principled Route to Multi-Subject Fidelity – Takara TLDR
Advanced AI Editor
  • Website

Related Posts

DeepSeek Launches New AI Model to Undercut OpenAI With 50% Cheaper API

October 2, 2025

DeepSeek Launches New AI Model, Cuts API Costs by 50%

October 2, 2025

DeepSeek tests “sparse attention” to slash AI processing costs

September 30, 2025

Comments are closed.

Latest Posts

Italian police seize 21 suspected forgeries attributed to Dalí

Acclaimed Sculptor Petrit Halilaj Wins $100,000 Nasher Prize

Syracuse University Starts First Program For Podcasters and Influencers

Sotheby’s Sells York Avenue HQ to Weill Cornell, Prepares Breuer Move

Latest Posts

StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions – Takara TLDR

October 3, 2025

Elon Musk’s AI War With OpenAI Explained As Rift Intensifies, Lands In Court

October 3, 2025

OpenAI is the world’s most valuable private company after private stock sale

October 3, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • StealthAttack: Robust 3D Gaussian Splatting Poisoning via Density-Guided Illusions – Takara TLDR
  • Elon Musk’s AI War With OpenAI Explained As Rift Intensifies, Lands In Court
  • OpenAI is the world’s most valuable private company after private stock sale
  • Massive fire breaks out at Chevron oil refinery in California
  • Comet AI browser goes free, challenging Google Chrome’s monopoly

Recent Comments

  1. Refugia Stasko on Baidu AI drive to boost jobs
  2. John Vea on Class Dismissed? Representative Claims in Getty v. Stability AI | Cooley LLP
  3. Mercedez Kramarczyk on Best Buy wants AI to offer customers fewer — but more relevant — search results
  4. Jordan Ellworths on Half of companies planning to replace customer service with AI are reversing course
  5. Raelene Jacobsma on Recent AI Funding Flows Into Four ‘F’s: Food, Fitness, Fashion, Finance

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.