DeepSeek Aims To Share Tech Behind AI Inference Engine In Latest Open-source Push

DeepSeek is looking to make one of the core components of its AI models more open and accessible to other developers.

The Chinese AI startup said it will be sharing technical details about its internal inference engine with the open-source community. Inferencing is one of the many stages of building a large language model (LLM). It involves the trained AI model generating new data, which shows the patterns that the model has learned based on its parameters.

DeepSeek said that its internal inference engine and training framework have been instrumental in accelerating the training and deployment of its AI models. While its training framework is built upon the PyTorch platform, the startup’s inference engine is a modified version of vLLM, an open-source library for LLM inferencing that has been developed by researchers at UC Berkeley, United States.

Story continues below this ad

“Given the growing demand for deploying models like DeepSeek-V3 and DeepSeek-R1, we want to give back to the community as much as we can. We are deeply grateful for the open-source ecosystem, without which our progress toward AGI [artificial general intelligence] would not be possible,” a DeepSeek researcher’s note posted on Hugging Face, an online repository for open-source AI models.

However, the company is not making its internal inference engine fully open-source and accessible. Instead, DeepSeek said it will share the design improvements it made to the vLLM inference engine as well as details about its implementation, with existing open-source projects. It also committed to pulling out useful features and sharing them as standalone, reusable libraries with the open-source community.

DeepSeek identified certain stumbling blocks to making its inference engine fully open-source such as lack of maintenance bandwidth, infrastructural restrictions, and a heavily customised codebase. In February this year, DeepSeek made portions of its AI models such as code repositories open-source as part of its ‘open-source week’ initiative.

Beyond cost and compute efficiency, DeepSeek’s breakthrough was celebrated by AI researchers and tech executives for being open-source. However, its models do not fit the widely accepted definition of an open-source AI system provided by the Open Source Initiative (OSI). The data used to train its flagship R1 model as well as the training framework and training code have not been released under the permissive MIT licence.

Expand

Source link

What's Hot

GIST and MIT Launch Full-Scale Research on Human-Centered Physical AI Interaction

Distyl AI Raises $175M Series B At $1.8B Valuation, Up 9x From Last Funding

The Oakland Ballers let an AI manage the team. What could go wrong?

DeepSeek aims to share tech behind AI inference engine in latest open-source push | Technology News

Huawei Uses Its Own Chips to Retrain DeepSeek and Align Output With Beijing’s Standards

DeepSeek reports shockingly low training costs for R1 in new paper

DeepSeek warns its open-source AI models are vulnerable to ‘jailbreaking’

St. Patrick’s Cathedral Unveils Monumental Mural by Adam Cvijanovic

Three Loaned Banksy Works Incite Dispute Between England and Italy

Major Collection of Old Masters Paintings Could Be Fractionalized

100 Must-See Artworks at the Metropolitan Museum of Art

GIST and MIT Launch Full-Scale Research on Human-Centered Physical AI Interaction

Distyl AI Raises $175M Series B At $1.8B Valuation, Up 9x From Last Funding

The Oakland Ballers let an AI manage the team. What could go wrong?

What's Hot

DeepSeek aims to share tech behind AI inference engine in latest open-source push | Technology News

Related Posts

Subscribe to Updates