Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

IBM vs. Amazon: Which Cloud Infrastructure Stock Offers More Upside? – July 15, 2025

Perplexity’s Comet is here, and after using it for 48 hours I’m convinced AI web browsers are the future of the internet

AWS doubles investment in AWS Generative AI Innovation Center, marking two years of customer success

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
DeepSeek

China’s Kimi K2 Could Be the Next DeepSeek Moment

By Advanced AI EditorJuly 15, 2025No Comments6 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


China’s open-source AI scene is heating up again. After DeepSeek’s rapid rise earlier this year, a new challenger is making waves in the form of Kimi K2 from Moonshot AI.

Although it launches with less fanfare, Kimi K2 is now drawing serious attention from AI insiders and outperforming some of the biggest names in the game.

It’s fast, climbing the ranks, beating expectations on benchmarks, and sparking comparisons to DeepSeek’s breakout moment. Some even believe it’s strong enough to have made OpenAI rethink its release schedule.

“China’s Kimi K2 is having its mini DeepSeek moment: it is now #14 on OpenRouter today, ahead of Grok 4 and GPT-4.1,” Deedy Das of Menlo Ventures wrote in a post on X

He added that this is a non-reasoning model, yet it scores highest on major EQ and creative writing benchmarks. “Best model smell since (Claude) 3.5 Sonnet,” he said.

Based on current API pricing, Kimi K2 is roughly 80-90% cheaper than Claude Sonnet 4 when comparing per-token costs, especially for API usage.

The model is now available in preview on GroqCloud at 185 tokens per second. 

Kimi K2 uses a sparse mixture‑of‑experts (MoE) design, featuring one trillion total parameters and 32 billion active ones per query. Of its 384 specialised expert subnetworks, only a few are activated dynamically based on the input. This setup lowers compute needs while preserving capacity. It also supports a 1,28,000-token context window. 

As soon as the model was dropped, OpenAI CEO Sam Altman announced a delay in the release of their open-source model.

“Kimi mogged OpenAI, and I genuinely think the real reason they delayed the open-source model release is Kimi K2,” AI enthusiast Ashutosh Shrivastava wrote on X. He added that OpenAI “never saw this coming”. Kimi K2 outperforms DeepSeek V3 and goes head-to-head with Claude Opus 4 and GPT-4.1.

This comes against the backdrop of OpenAI naming another Chinese AI startup, Zhipu, as a potential threat to its dominance.

People I respect are speculating that OpenAI is pushing back their open-source release due to Kimi K2. It does strike me that this is what Llama 4 was supposed to be; a massive, impressive open-source MoE model that can form the basis for a new generation of agentic AI… pic.twitter.com/CxP4m6Lp38

— Chris Paxton (@chris_j_paxton) July 12, 2025

Kimi K2 delivered top-tier results in coding and math benchmarks. On SWE-bench Verified, it scored 65.8%, outperforming GPT-4.1 at 54.6% and coming close to Claude Sonnet 4. On LiveCodeBench, it achieved 53.7%, ahead of DeepSeek V3 (46.9%) and GPT-4.1 (44.7%).

In the Math-500 benchmark, it scored 97.4%, compared to GPT-4.1’s 92.4%. Kimi K2 also performs strongly across AIME, GPQA, OGBench, and tool-use evaluations.

Artificial Analysis said that while Moonshot AI’s Kimi K2 is the leading open-weight non-reasoning model in its Intelligence Index, it outputs roughly three times more tokens than other non-reasoning models, blurring the line between reasoning and non-reasoning.

As a non-reasoning model, it excels in creative tasks. It is now the Short-Story Creative Writing champion, scoring 8.56 and surpassing the previous leader, o3-pro, which scored 8.44.

Kimi-K2-Instruct now ranks #1 on EQ-Bench 3, a benchmark for emotional intelligence in LLMs. It leads GPT-4o, Claude, and Gemini across empathy, insight, and creative writing. pic.twitter.com/91amc3W9wB

— 👋 Jan (@jandotai) July 14, 2025

Agentic Capabilities 

Kimi K2 has good agentic capabilities. According to the company, unlike traditional LLMs, Kimi K2 can plan and execute multi‑step tasks autonomously. It can call external APIs, generate and debug code, create plots, webpages and more, all without manual prompting at each step. 

Kimi K2 one-shotted a web version of 3D Minecraft!

K2 is meticulously optimized for agentic capabilities. Designed for tool use and autonomous problem-solving.

It automatically understands how to use the tools and gets the job done. You don’t have to write any complex workflow… https://t.co/mmw5qlesJC pic.twitter.com/yHMS9A1YAN

— cedric (@cedric_chee) July 11, 2025

There are two versions of the model. While the Base variant is designed for research and fine-tuning, the Instruct variant is intended for use in chatbots and agents.

In a blog post, the company shared that Kimi K2’s agentic abilities are driven by two core components: large-scale tool-use training and general reinforcement learning (RL).

In order to teach the model how to use tools effectively, Moonshot AI built a large-scale synthetic data pipeline inspired by ACEBench. This system simulates real-world tool-use tasks across hundreds of domains and thousands of tools, combining both real and synthetic examples. 

“Our approach systematically evolves hundreds of domains containing thousands of tools, including both real MCP (Model Context Protocol) tools and synthetic ones, then generates hundreds of agents with diverse tool sets,” the company said.

It Comes with Flaws

Despite the good benchmark figures, Ethan Mollick, a professor at Wharton, described Kimi K2 as “a really weird model” that still needs much more testing. He recounted an experiment where he gave it a slightly altered version of the novel The Great Gatsby. 

Like Claude, the model spotted the two intentional changes, but then “made up a ton of hallucinated nonsense that sounded plausible but was just plain wrong”.

He added that the DeepSeek moment was largely fueled by pent-up consumer demand for high-quality free AI, especially among students looking for help with homework.

According to him, Kimi K2, despite its strong performance, hasn’t seen the same immediate public impact. One possible reason he observed is that for most consumers and students, “DeepSeek is good enough”.

“​​Feels like unlike DeepSeek, the general public hasn’t felt the effect/impacts of Kimi K2 yet – most non-technical people have probably never even heard of it. Wonder why it is being overlooked when DeepSeek got so much attention,” wrote a user on X.

Meanwhile, DeepSeek’s upcoming model, R2, is still unreleased, and it may be delayed further. A recent report suggests that US export restrictions on NVIDIA’s H20 chips, which are essential for training and deploying the model, could pose serious challenges in China.

Kimi K2 may not have the same hype DeepSeek had, but its performance is hard to ignore. With strong benchmarks and growing visibility, it is clear that China’s open-source push is far from over.





Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleAmazon-backed Anthropic rolls out Claude AI for financial services – NBC New York
Next Article AI To Cut Expenses Can Boost Sales Through Better Customer Service
Advanced AI Editor
  • Website

Related Posts

AI Showdown: Grok 3, Grok 4, ChatGPT, Gemini and DeepSeek — Which AI Wins for SA Creators?

July 15, 2025

How DeepSeek is upending AI innovation and investment

July 14, 2025

OpenAI delays its first open-source AI model challenging DeepSeek

July 14, 2025

Comments are closed.

Latest Posts

The Artists and Art Pros Who Donated to Cuomo and Mamdani’s Campaigns

Phillips Sues Billionaire’s Son Over $14.5 M. Pollock Painting

Murujuga Rock Art in Australia Receives UNESCO World Heritage Status

‘Earth Room’ Caretaker Dies at 70

Latest Posts

IBM vs. Amazon: Which Cloud Infrastructure Stock Offers More Upside? – July 15, 2025

July 15, 2025

Perplexity’s Comet is here, and after using it for 48 hours I’m convinced AI web browsers are the future of the internet

July 15, 2025

AWS doubles investment in AWS Generative AI Innovation Center, marking two years of customer success

July 15, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • IBM vs. Amazon: Which Cloud Infrastructure Stock Offers More Upside? – July 15, 2025
  • Perplexity’s Comet is here, and after using it for 48 hours I’m convinced AI web browsers are the future of the internet
  • AWS doubles investment in AWS Generative AI Innovation Center, marking two years of customer success
  • Varun Mohan Joins Google as Cognition Acquires Windsurf After OpenAI Deal Collapse
  • Former OpenAI CTO Mira Murati raises $2 billion for new AI startup Thinking Machines Lab – NBC New York

Recent Comments

  1. ⛏ Ticket- Operation 1,208189 BTC. Assure => https://graph.org/Payout-from-Blockchaincom-06-26?hs=53d5900f2f8db595bea7d1d205d9c375& ⛏ on Were RNNs All We Needed? (Paper Explained)
  2. 📗 + 1.333023 BTC.NEXT - https://graph.org/Payout-from-Blockchaincom-06-26?hs=ec6999251b5fd7a82cd3e6db8f19412e& 📗 on OpenAI is pushing for industry-specific AI benchmarks – why that matters
  3. 📏 + 1.602160 BTC.NEXT - https://graph.org/Payout-from-Blockchaincom-06-26?hs=68a63a7dd7346634ec406c95aa051292& 📏 on [News] Soccer AI FAILS and mixes up ball and referee’s bald head.
  4. 🖱 Reminder; + 1.859736 bitcoin. Get >>> https://graph.org/Payout-from-Blockchaincom-06-26?hs=ea1f0b9078972b08ef3081fd29f37328& 🖱 on Meta has revenue sharing agreements with Llama AI model hosts, filing reveals
  5. 🔐 Email: TRANSFER 1,339860 BTC. Assure => https://graph.org/Payout-from-Blockchaincom-06-26?hs=636969481c537edfddb345a6023fc080& 🔐 on James Cameron Wants to Use AI to ‘Cut the Cost’ of Making Films

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.