Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

MIT CSAIL Director Daniela Rus Presents New Self-Driving Models

Pittsburgh weekly roundup: Axios-OpenAI partnership; Buttigieg visits CMU; AI ‘employees’ in the nonprofit industry

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • Adobe Sensi
    • Aleph Alpha
    • Alibaba Cloud (Qwen)
    • Amazon AWS AI
    • Anthropic (Claude)
    • Apple Core ML
    • Baidu (ERNIE)
    • ByteDance Doubao
    • C3 AI
    • Cohere
    • DataRobot
    • DeepSeek
  • AI Research & Breakthroughs
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Education AI
    • Energy AI
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Media & Entertainment
    • Transportation AI
    • Manufacturing AI
    • Retail AI
    • Agriculture AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
Advanced AI News
Home » Distilled AI runs on a single GPU
DeepSeek

Distilled AI runs on a single GPU

Advanced AI BotBy Advanced AI BotMay 30, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


The next big thing from DeepSeek isn’t here yet. That’s DeepSeek R2, which is in development and should bring notable performance improvements. But like OpenAI, Google, and other AI firms, the Chinese startup continues to upgrade the models it released publicly in recent months.

DeepSeek R1 is one of those models. It’s a reasoning AI that DeepSeek released in early 2025, shaking up the AI stock market. In case you forgot, DeepSeek managed to train a frontier AI model as good as ChatGPT o1 without access to the latest Nvidia hardware used by US AI firms.

DeepSeek relied on software innovations to make up for its hardware limitations, and DeepSeek R1 became a hit AI app overnight. The company also launched its AI models as open-source, allowing users to install them on their own devices and run them locally without needing an internet connection.

Open-sourcing DeepSeek helped its AI models spread even faster. At the same time, access to an open-source version of DeepSeek R1 helps prevent user data from reaching Chinese servers and lets researchers bypass some of the built-in censorship found in web and mobile apps.

Tech. Entertainment. Science. Your inbox.

Sign up for the most interesting tech & entertainment news out there.

By signing up, I agree to the Terms of Use and have reviewed the Privacy Notice.

While I’ve advised caution when using AI models that involve heavy censorship or send user data to places like China, it’s ultimately your choice which models you want to use regularly. 

If you’re a fan of the DeepSeek experience, you’ll be glad to know the Chinese startup just upgraded the R1 model and released a smaller, distilled version that only needs one GPU to run.

DeepSeek released the updated R1 model on Hugging Face this week, a platform well known in the AI world for offering a variety of new tools, including unreleased chatbots that are still in testing.

While DeepSeek hasn’t shared many details about the new R1 model, we know it has 685 billion parameters. That’s a large model requiring substantial resources to run. As TechCrunch explains, the full-size R1 needs around a dozen 80GB GPUs to run locally.

The updated model is expected to deliver better performance and reduce hallucinations, according to a post on WeChat. A similar description is available on DeepSeek’s website, although the company didn’t promote this release as heavily as before.

“The model has demonstrated outstanding performance across various benchmark evaluations, including mathematics, programming, and general logic,” DeepSeek said, per Reuters.

The smaller version of R1 is even more exciting. The model name, DeepSeek-R1-0528-Qwen3-8B (Hugging Face link), reveals it’s a reasoning model released on May 28th, based on the Qwen3-8B model that Alibaba introduced in May.

Alibaba is one of a growing number of Chinese AI firms launching high-end models that directly compete with ChatGPT, Claude, and other US-developed AIs.

DeepSeek used the newly upgraded R1 model’s data to train the Qwen3-8B, creating the distilled version of R1.

As a reminder, DeepSeek stirred controversy when R1 debuted, with OpenAI accusing the startup of using ChatGPT data without permission to speed up R1’s training. OpenAI itself has also faced accusations of using data from sources without proper authorization for training its models.

What stands out about DeepSeek-R1-0528-Qwen3-8B is that it only requires a GPU with 40GB to 80GB of RAM to run. Nvidia’s H100 is a suitable example. This makes it easier for AI hobbyists and developers to experiment with DeepSeek R1 locally without hefty hardware costs.

The hardware requirements are impressive, especially given the power of the distilled DeepSeek R1 model.

Despite being a smaller version, this R1 model is performing well in benchmarks. DeepSeek-R1-0528-Qwen3-8B has outperformed Google’s Gemini 2.5 Flash in AIME 2025, a series of tough math problems.

The smaller DeepSeek R1 also nearly matches Microsoft’s Phi 4 reasoning model in HMMT math tests.

The only way to use the smaller R1 model, though, is by installing it on your own computer.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleNvidia flags China AI risks as CEO supports Trump’s export policy shift
Next Article Mistral AI launches code embedding model, claims edge over OpenAI and Cohere – Computerworld
Advanced AI Bot
  • Website

Related Posts

DeepSeek can undercut larger ChatGPT, ace investor Mary Meeker warns

May 31, 2025

China needs academic shake-up if it wants more innovations like DeepSeek, scholar says

May 31, 2025

Distilled AI runs on a single GPU

May 31, 2025
Leave A Reply Cancel Reply

Latest Posts

Paley Museum In NY Celebrates Six-Season Run Of ‘The Handmaid’s Tale’

Tessa Hulls On The Weight Of History, The Power Of Comics, And Winning A Pulitzer Prize

New Las Vegas Exhibit Displays Five Cirque Du Soleil Shows’ Costumes

Trump Fires National Portrait Gallery Director Kim Sajet

Latest Posts

Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation

May 31, 2025

MIT CSAIL Director Daniela Rus Presents New Self-Driving Models

May 31, 2025

Pittsburgh weekly roundup: Axios-OpenAI partnership; Buttigieg visits CMU; AI ‘employees’ in the nonprofit industry

May 31, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

YouTube LinkedIn
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.