Jointly developed by open-source initiative Agentica and San Francisco-based start-up Together AI, DeepSWE was trained on the Qwen3-32B large language model (LLM) – part of Alibaba Cloud’s third-generation family of AI models. It topped the leaderboard of the latest SWEBench-Verified test, scoring 59 per cent accuracy against other so-called open-weight models like DeepSeek’s V3-0324, the developers said in a blog post on Wednesday.
Agentic frameworks are software platforms that provide the structure, tools and functionalities to build, deploy and manage AI agents. They enable AI agents to collaborate, make decisions and automate complex tasks.
AI agents, such as Chinese start-up Butterfly Effect’s Manus, are software programs that are capable of autonomously performing tasks on behalf of a user or another system. Essentially, these agents create a plan of specific tasks and subtasks to complete a goal using available resources.
DeepSWE marks the latest example of Hangzhou-based Alibaba’s growing leadership position in the global open-source community. Alibaba owns the South China Morning Post.
The open-source approach gives public access to a program’s source code, allowing third-party software developers to modify or share its design, fix broken links or scale up its capabilities.
