Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

mSCoRe: a Multilingual and Scalable Benchmark for Skill-based Commonsense Reasoning – Takara TLDR

Tripo, the Frontrunner of 3D AI Boom, Supercharges New Era in Content Creation with 3.0 Upgrade

HRM vs Claude OPUS 4: How a Small AI Model Outperformed a Giant

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
DeepSeek

DeepSeek V3.1 pushes open-source AI forward with smarter context and reasoning

By Advanced AI EditorAugust 20, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


DeepSeek has announced V3.1, an upgrade to its large language model. The release took place on 19 August 2025 through the company’s official WeChat group. Though the announcement was low-key, the AI community has been quick to react. Developers and researchers are calling it a step forward for open-source models.

The most notable improvement is the expanded context length. V3.1 can now handle 128,000 tokens in a single query. This matches the open-source version and allows the model to manage long conversations, technical documents, and retrieval-based tasks with better Accuracy. Enterprises see this as a strong feature for data-heavy workflows.

V3.1 also raises the parameter count to 685 billion, compared to 671 billion in V3. Despite the increase, costs remain under control thanks to its Mixture-of-Experts design. Only 37 billion parameters are active per token, which reduces the expense of running the model. This efficiency makes it competitive with closed systems such as GPT-4o and Claude 3.5 Sonnet.

Stronger in coding, logic and math

Community testing has shown that V3.1 performs better in problem-solving. It has been able to complete tasks involving complex rules and logical reasoning. Developers also note stronger results in coding, especially in Python and Bash. Accuracy benchmarks now stand close to 60 per cent, which is higher than before.

Mathematics is another clear improvement. The model builds on V3’s success, which outperformed rivals like Qwen2.5 72B on tests such as AIME and MATH-500. These results confirm V3.1’s value for users who work on scientific or analytical projects.

Open-Source release and future outlook

DeepSeek has continued its open approach by releasing V3.1 under the MIT Licence. Developers can access the model on Hugging Face in Safetensors format. While major inference providers have yet to add support, the release is already in use within open-source communities.

Training costs were only 5.6 million US dollars, achieved with 2.788 million H800 GPU hours. In comparison, proprietary models often cost more than one hundred million to build. This cost advantage has earned DeepSeek the title “the Pinduoduo of AI,” showing its ability to deliver at scale without huge budgets.

V3.1 works smoothly with existing APIs, making integration simple for businesses. It is available through the company’s website, mobile app, and WeChat mini-program. The knowledge cut-off stands at July 2025. Online forums have started speculating that V3.1 could be followed by DeepSeek-R2, a reasoning-focused release expected in 2026. DeepSeek V3.1 is already being viewed as more than a routine update. It signals a closing gap between open and closed systems. With stronger reasoning, more context, and reduced costs, V3.1 positions DeepSeek as a serious player in the AI race.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleOpenAI’s big step toward personalized AI
Next Article AI creating ‘potentially new’ music genres as artists take control, says Stability AI study
Advanced AI Editor
  • Website

Related Posts

DeepSeek Pushes Out V3.1 Update as Nvidia Dominates AI Hardware

August 20, 2025

DeepSeek Version 3.1 Raises Growth Stocks, Baidu Reports Q2 Earnings

August 20, 2025

Department of Energy national labs study DeepSeek; A major shakeup in the Pentagon’s AI enterprise

August 20, 2025

Comments are closed.

Latest Posts

Tanya Bonakdar Gallery to Close Los Angeles Space

Dallas Museum of Art Names Brian Ferriso as Its Next Director

Rapa Nui’s Moai Statues Threatened by Rising Sea Levels, Flooding

Mickalene Thomas Accused of Harassment by Racquel Chevremont

Latest Posts

mSCoRe: a Multilingual and Scalable Benchmark for Skill-based Commonsense Reasoning – Takara TLDR

August 21, 2025

Tripo, the Frontrunner of 3D AI Boom, Supercharges New Era in Content Creation with 3.0 Upgrade

August 21, 2025

HRM vs Claude OPUS 4: How a Small AI Model Outperformed a Giant

August 21, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • mSCoRe: a Multilingual and Scalable Benchmark for Skill-based Commonsense Reasoning – Takara TLDR
  • Tripo, the Frontrunner of 3D AI Boom, Supercharges New Era in Content Creation with 3.0 Upgrade
  • HRM vs Claude OPUS 4: How a Small AI Model Outperformed a Giant
  • FieldAI raises $405M to build universal robot brains
  • IBM and NASA Release Groundbreaking Open-Source AI Model on Hugging Face to Predict Solar Weather and Help Protect Critical Technology

Recent Comments

  1. ArturoJep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. Charlescak on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. ArturoJep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. ArturoJep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. Charlescak on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.