Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Multiple-Scattering Microfacet BSDFs with the Smith Model

EU won’t delay AI law rollout despite tech industry’s pushback

‘I’ll fight to keep every one of you’: OpenAI executive Mark Chen pushes back as Meta poaches AI talent

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • Amazon (Titan)
    • Anthropic (Claude 3)
    • Cohere (Command R)
    • Google DeepMind (Gemini)
    • IBM (Watsonx)
    • Inflection AI (Pi)
    • Meta (LLaMA)
    • OpenAI (GPT-4 / GPT-4o)
    • Reka AI
    • xAI (Grok)
    • Adobe Sensi
    • Aleph Alpha
    • Alibaba Cloud (Qwen)
    • Apple Core ML
    • Baidu (ERNIE)
    • ByteDance Doubao
    • C3 AI
    • DataRobot
    • DeepSeek
  • AI Research & Breakthroughs
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Education AI
    • Energy AI
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Media & Entertainment
    • Transportation AI
    • Manufacturing AI
    • Retail AI
    • Agriculture AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
Facebook X (Twitter) Instagram
Advanced AI News
DeepSeek

Deepseek R1-0528: German Firm Releases Version of DeepSeek’s AI Model That Runs Twice as Fast

Advanced AI EditorBy Advanced AI EditorJuly 5, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


German IT firm TNG Technology Consulting has released a new open-source AI model that is reportedly twice as fast as the DeepSeek R1-0528 variant from May it is based on. Released this week on the Hugging Face platform, DeepSeek-TNG R1T2 Chimera achieves its remarkable efficiency through a novel ‘Assembly-of-Experts’ technique.

This method merges components from three different parent models, including the original DeepSeek R1 and V3 models. The result is a model that retains high-level reasoning capabilities while generating answers with 60% fewer tokens, drastically cutting inference costs and response times for developers.

The AI developer community has responded with enthusiasm. On X, Hugging Face senior leader Vaibhav Srivastav wrote, “DAMN! DeepSeek R1T2 – 200% faster than R1-0528 & 20% faster than R1,” highlighting its performance gains. The model is available under a permissive MIT License, allowing for broad commercial use and modification.

Assembly-of-Experts: A Novel Approach to Model Creation

TNG’s “Assembly-of-Experts” (AoE) method represents a significant departure from conventional model creation. Instead of fine-tuning or retraining, AoE builds a new model by selectively merging the weight tensors from multiple pre-trained parents, a process detailed in a recent research paper from June.

The implementation focuses on merging the routed expert tensors—the parts of a model most responsible for specialized knowledge—while retaining the more efficient shared layers from faster parents. This “Tri-Mind” Chimera combines the reasoning of R1-0528, structured thought of R1, and conciseness of V3-0324.

DeepSeek-TNG R1T2 Chimera intelligence_score_vs_output_tokens

This approach is distinct from the Mixture-of-Experts (MoE) architecture used in its parent models. While MoE is a runtime architecture that activates a fraction of a model’s “experts” for any given task, AoE is a construction technique that bakes the combined expertise into a single, more efficient final model.

Benchmarks: Balancing Raw Intelligence with Extreme Efficiency

The practical benefit of this technique is a powerful balance of intelligence and speed. According to benchmarks published by TNG, R1T2 Chimera achieves between 90% and 92% of the reasoning performance of its most powerful parent, R1-0528, on demanding tests like AIME and GPQA.

These benchmarks are designed to test sophisticated, multi-step reasoning that goes far beyond simple knowledge recall. However, the model’s key advantage is conciseness. It generates correct answers using approximately 40% of the tokens required by R1-0528, a 60% reduction in output length.

This directly translates to faster response times and lower compute costs, making it over twice as fast in practical terms. This efficiency was a hallmark of its V3 parent. After its March release, developer Awni Hannun said of the improved March 2025 variant of V3, “it’s the most powerful model I’ve ever run on my laptop,” after running it on his laptop. R1T2 Chimera successfully grafts this efficiency onto a stronger reasoning core.

An Innovation Amid Geopolitical and Corporate Headwinds

The release of this highly efficient model comes at a turbulent time for its original creator, DeepSeek AI. The Chinese firm’s momentum has stalled, with its anticipated R2 model now indefinitely delayed. This is due to both internal performance dissatisfaction and the impact of US export controls on vital AI chips.

Simultaneously, DeepSeek faces mounting regulatory pressure in the West. In Germany, Berlin’s data protection authority has requested Apple and Google remove the DeepSeek app from stores, labeling it “unlawful content” due to illegal data transfer risks to China.

This follows a damning April report from the US House Select Committee on the CCP. Committee Chairman John Moolenaar stated, “this report makes it clear: DeepSeek isn’t just another AI app — it’s a weapon in the Chinese Communist Party’s arsenal…,” alleging the app is a tool for espionage and data harvesting. These external pressures create a complex backdrop for any technology derived from DeepSeek’s work.

Enterprise Deployment: Availability, Licensing, and Limitations

For enterprise technical leaders, R1T2 Chimera presents a compelling option. Its MIT license offers maximum flexibility for private hosting, customization, and deployment in commercial applications without licensing fees. The significant reduction in inference cost makes it ideal for high-throughput or real-time environments.

The cost savings are particularly relevant for applications like real-time customer support chatbots, large-scale document summarization, or internal knowledge base queries, where both speed and budget are critical. It places the model in a desirable quadrant on the performance-versus-cost curve.

However, TNG notes some current limitations. The model is not yet recommended for use cases requiring function calling or tool use, meaning it cannot reliably interact with external APIs. This limits its use in complex, automated workflows, though future updates may address this gap.

Furthermore, the company advises European users to assess their compliance with the EU AI Act, which has extraterritorial reach. Despite these caveats, the release of R1T2 Chimera by TNG marks a notable step in modular AI development, offering a glimpse into a future where models are assembled, not just trained.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleGoogle faces EU antitrust complaint over AI Overviews
Next Article Ford CEO Jim Farley warns AI will wipe out half of white-collar jobs, but the ‘essential economy’ has a huge shortage of workers
Advanced AI Editor
  • Website

Related Posts

DeepSeek’s LinkedIn AI job listings show hunger for international Chinese talent

July 4, 2025

China’s open-source AI push expands after DeepSeek, as Baidu and Huawei launch new models

July 4, 2025

Why DeepSeek AI has the tech world on red alert

July 3, 2025
Leave A Reply Cancel Reply

Latest Posts

Albright College is Selling Its Art Collection to Balance Its Books

Big Three Auction Houses Hold Old Masters Sales in London This Week

MFA Boston Returns Two Works to Kingdom of Benin

Tate’s £150M Endowment Campaign May Include Turbine Hall Naming Rights

Latest Posts

Multiple-Scattering Microfacet BSDFs with the Smith Model

July 6, 2025

EU won’t delay AI law rollout despite tech industry’s pushback

July 6, 2025

‘I’ll fight to keep every one of you’: OpenAI executive Mark Chen pushes back as Meta poaches AI talent

July 6, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Multiple-Scattering Microfacet BSDFs with the Smith Model
  • EU won’t delay AI law rollout despite tech industry’s pushback
  • ‘I’ll fight to keep every one of you’: OpenAI executive Mark Chen pushes back as Meta poaches AI talent
  • Google DeepMind’s Deep Q-Learning & Superhuman Atari Gameplays | Two Minute Papers #27
  • Are We Living In a Computer Simulation? | Two Minute Papers #28

Recent Comments

No comments to show.

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

YouTube LinkedIn
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.