Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

HPE Expands NVIDIA AI Enterprise Integration with Blackwell GPU Solutions

Elon Musk cries antitrust as X & Grok can’t compete with OpenAI

IBM relocates thousands of employees to One Madison Ave

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
DeepSeek

DeepSeek shows enterprises model distillation opportunity

By Advanced AI EditorAugust 8, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Model distillation is one of the technology trends that has reached a level of maturity identified in Gartner’s 2025 Hype Cycle for artificial intelligence (AI) as “the slope of enlightenment”.

However, while it was recently put into the spotlight at the start of the year with China’s DeepSeek demonstrating how model distillation can be used to train a large language model (LLM) that rivals models from OpenAI, it is not a new development, with Haritha Khandabattu, senior director analyst at Gartner, saying: “I was actually researching model distillation in 2017.”

In fact, the technique dates back to the 2006 Cornell university Model compression paper by Cristian Bucilă, Rich Caruana and Alexandru Niculescu-Mizil. Nine years later, in 2015, Cornell university’s Distilling the knowledge in a neural network paper by Geoffery Hinton, Oriol Vinyals and Jeff Dean used the term distillation to describe a technique to improve the performance of AI models.  

Although it is not considered a new technological development by Gartner, Khandabattu said: “Model distillation has been re-emphasised. The foundation models are compute hungry and extremely expensive to run, and enterprises have started asking how they can get 80% of the performance at 10% of the cost.”

She said DeepSeek has led to a downward pricing trend for pricing over the past six to 12 months. But rather than adapt to these price changes, Khandabattu recommended that CIOs “plan their use cases and prioritise with the expectation that training and inference costs will continue to decline”.

Khandabattu said that even the large AI technology providers recognise the usefulness of model distillation to enable more deployable, more tunable and more governable AI, adding: “Model distillation is finally gaining commercial traction.”

She describes model distillation as a bridge between innovation and scalability: “Model distillation unlocks both technical merit and access. It offers lower inference cost and IT infrastructure expenses are also a bit lower, which makes model distillation cost-effective for certain AI deployments.”

But Khandabattu also noted that there are other costs IT leaders need to consider beyond the IT infrastructure needed to run inference workloads. “CIOs need to be extremely careful and recognise that the total cost of deploying GenAI [generative AI] applications is not limited to the cost of the models.”

There are engineering costs and costs associated with integrating the AI system with enterprise IT, she said, adding: “Fine-tuning an AI model costs a lot of money. If the model provider decides to change the model, then you have to change all of the things that you’ve built on the older model to the newer one, which is very expensive.”

Beyond model distillation, she said: “With AI investment remaining strong this year, a sharper emphasis is being placed on using AI for operational scalability and real-time intelligence.”

According to Gartner, this has led to a gradual pivot from generative AI as a central focus, toward the foundational enablers that support sustainable AI delivery, such as AI-ready data and AI agents.

“Despite the enormous potential business value of AI, it isn’t going to materialise spontaneously,” said Khandabattu. “Success will depend on tightly business aligned pilots, proactive infrastructure benchmarking, and coordination between AI and business teams to create tangible business value.”

Among the AI innovations Gartner has forecast will reach mainstream adoption in the next five years are multimodal AI and AI trust, risk and security management (TRiSM).

Multimodal AI models are trained with multiple types of data simultaneously, such as images, video, audio and text. TRiSM is focused on layers of technical capabilities that support enterprise policies for all AI use cases and help assure AI governance, trustworthiness, fairness, safety, reliability, security, privacy and data protection. Gartner has predicted that, in combination, these developments will enable more robust, innovative and responsible AI applications, transforming how businesses and organisations operate.

Gartner also expects AI agents are at least two to five years away from becoming mainstream. 

“To reap the benefits of AI agents, organisations need to determine the most relevant business contexts and use cases, which is challenging given no AI agent is the same and every situation is different,” said Khandabattu. “Although AI agents will continue to become more powerful, they can’t be used in every case, so use will largely depend on the requirements of the situation at hand.”



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleI tested DeepSeek R1 vs Qwen 2.5 vs ChatGPT o3-mini with 7 prompts – here’s the winner
Next Article Paper page – I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating Linguistic Shibboleth Detection in LLM Hiring Evaluations
Advanced AI Editor
  • Website

Related Posts

What is DeepSeek? All about China’s latest AI model

August 11, 2025

Zetrix Develops Shariah-Compliant NurAI LLM With DeepSeek

August 11, 2025

China’s infrastructure enters ‘DeepSeek moment’

August 11, 2025

Comments are closed.

Latest Posts

Midjourney Slams Lawsuit Filed by Disney to Prevent AI Training

Smithsonian Updates Museum Display on Impeachment To Include Trump

Funder Tried to Hijack Kandinsky Art Theft Suits, Says Collector

Historic Ukrainian Synagogue Damaged by Russian Drone Strike

Latest Posts

HPE Expands NVIDIA AI Enterprise Integration with Blackwell GPU Solutions

August 12, 2025

Elon Musk cries antitrust as X & Grok can’t compete with OpenAI

August 12, 2025

IBM relocates thousands of employees to One Madison Ave

August 12, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • HPE Expands NVIDIA AI Enterprise Integration with Blackwell GPU Solutions
  • Elon Musk cries antitrust as X & Grok can’t compete with OpenAI
  • IBM relocates thousands of employees to One Madison Ave
  • Creating uniquely human digital banking experiences at TD
  • C3 AI Stock Plunges After ‘Completely Unacceptable’ Q1 Sales – C3.ai (NYSE:AI)

Recent Comments

  1. ThomasWep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. EdwardEnror on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. ThomasWep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. ThomasWep on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. EdwardEnror on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.