DeepSeek

DeepSeek Has ‘Cracked’ Cheap Long Context for LLMs With Its New Model

By Advanced AI EditorSeptember 30, 2025No Comments2 Mins Read

DeepSeek, the China-based AI lab, has released DeepSeek-V3.2-Exp, an experimental AI model on September 29. The model is claimed to achieve ‘significant efficiency improvements in both training and inference’.

It is built upon the DeepSeek-V3.1-Terminus, which itself is an upgraded version of the DeepSeek-V3.1 model.

It introduces what is called ‘DeepSeek Sparse Attention (DSA)’, a sparse attention mechanism designed to explore and validate optimisations for training and inference efficiency in long-context scenarios, according to the company.

Previous ArticleWhat OpenAI’s Research Reveals About The Future Of AI Search

Next Article PixelCraft: A Multi-Agent System for High-Fidelity Visual Reasoning on Structured Images – Takara TLDR

Advanced AI Editor

Comments are closed.