Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

What Do We Want From Legal AI? – Artificial Lawyer

Paper page – Music Arena: Live Evaluation for Text-to-Music

Anthropic Sets Weekly Limits on Claude AI to Curb Misuse, Maintain Reliability

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
DeepSeek

Chinese startup Z.ai releases cost-efficient GLM-4.5 reasoning model

By Advanced AI EditorJuly 29, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Chinese startup Z.ai today open-sourced GLM-4.5, a reasoning model that it claims is more cost-efficient than DeepSeek’s R1.

CNBC reported that the algorithm can run on eight H20 graphics cards. The H20 is a scaled-down version of Nvidia Corp.’s H100 chip, which was its flagship artificial intelligence accelerator until last year. The U.S. government recently greenlit the sale of the former processor to companies in China.

The launch of GLM-4.5 comes about six months after DeepSeek released its open-source R1 reasoning model. At the time, the company stated that the algorithm can perform some tasks using 50 times less hardware than OpenAI’s o1. Furthermore, DeepSeek claimed to have trained its model for a fraction of the cost of earlier AI projects.

R1’s release led to investor concerns that increasingly hardware-efficient language models may lower demand for AI infrastructure. Nvidia’s market capitalization dropped more than $580 billion in the subsequent selloff, setting a new Wall Street record. The release of GLM-4.5 today didn’t lead to a similar drop in AI stocks, but it sends investors another signal that reasoning models are continuing to become more hardware-efficient.

Z.ai reportedly expects to charge 11 cents for every 1 million input tokens entered into GLM-4.5. That’s three cents lower than R1. One million output tokens cost 28 cents, just over one-10th what DeepSeek charges for R1.

One of the main factors behind GLM-4.5’s cost efficiency is that it’s relatively small. The model features 355 billion parameters, or about 316 million less than R1. GLM-4.5 only activates 32 billion of those parameters at any given time to reduce hardware usage.

An AI model comprises numerous code snippets called artificial neurons that each perform a tiny portion of the work involved processing a prompt. Those neurons, in turn, are organized into so-called layers. Z.ai removed some of GLM-4.5’s components to add more layers, an approach that it says helped boost the model’s reasoning skills.

The company trained GLM-4.5 through a multistep workflow. First, it developed an initial version of the model using a dataset that included 15 trillion tokens’ worth of information. Z.ai then honed GLM-4.5’s reasoning skills with several smaller training datasets that together comprised more than 7 trillion tokens. 

The company evaluated the model’s capabilities using a dozen popular AI benchmarks. According to Z.ai, GLM-4.5 outperformed multiple popular alternatives including Claude 4 Opus. It ranked third behind xAI Holdings Corp.’s Grok 4 and OpenAI’s o3.

For use cases that place particular emphasis on cost-efficiency, Z.ai has developed a scaled-down version of its model called GLM-4.5-Air. The algorithm features 106 billion parameters, or about three times less than the original. GLM-4.5-Air activates 12 billion parameters to process prompts.

In January, the U.S. Commerce Department added Z.ai to its Entity List of organizations subject to export controls. The company is backed by $1.5 billion in funding from Alibaba Group, Tencent Inc. and other investors. It reportedly plans to file for a public offering later this year. 

Image: Unsplash

Support our open free content by sharing and engaging with our content and community.

Join theCUBE Alumni Trust Network

Where Technology Leaders Connect, Share Intelligence & Create Opportunities

11.4k+  

CUBE Alumni Network

C-level and Technical

Domain Experts

Connect with 11,413+ industry leaders from our network of tech and business leaders forming a unique trusted network effect.

SiliconANGLE Media is a recognized leader in digital media innovation serving innovative audiences and brands, bringing together cutting-edge technology, influential content, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — such as those established in Silicon Valley and the New York Stock Exchange (NYSE) — SiliconANGLE Media operates at the intersection of media, technology, and AI. .

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a powerful ecosystem of industry-leading digital media brands, with a reach of 15+ million elite tech professionals. The company’s new, proprietary theCUBE AI Video cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleOpenAI prepares GPT-5 for roll out
Next Article Alibaba-backed Moonshot releases new Kimi AI model that beats ChatGPT, Claude in coding — and it costs less – NBC 6 South Florida
Advanced AI Editor
  • Website

Related Posts

China’s AI monster is here and it’s coming for DeepSeek’s throne

July 28, 2025

Chinese universities want students to use more AI, not less

July 28, 2025

The DeepSeek moment: China’s economic threat

July 28, 2025

Comments are closed.

Latest Posts

Picasso’s ‘Demoiselles’ May Not Have Been Inspired by African Art

Catalan National Assembly protested the restitution of murals to Aragon.

UNESCO Adds 26 Sites to World Heritage List

Aspen Art Fair Doubles in Size for 2025 Edition

Latest Posts

What Do We Want From Legal AI? – Artificial Lawyer

July 29, 2025

Paper page – Music Arena: Live Evaluation for Text-to-Music

July 29, 2025

Anthropic Sets Weekly Limits on Claude AI to Curb Misuse, Maintain Reliability

July 29, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • What Do We Want From Legal AI? – Artificial Lawyer
  • Paper page – Music Arena: Live Evaluation for Text-to-Music
  • Anthropic Sets Weekly Limits on Claude AI to Curb Misuse, Maintain Reliability
  • Jim Cramer Notes IBM Stock Sell-Off Despite Strong Earnings
  • Tesla signs $16.5B deal with Samsung to make AI chips

Recent Comments

  1. binance kód on Anthropic closes $2.5 billion credit facility as Wall Street continues plunging money into AI boom – NBC Los Angeles
  2. 🖨 🔵 Incoming Message: 1.95 Bitcoin from exchange. Claim transfer => https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=40f06aae45d2dc14b01045540f836756& 🖨 on SFC Dialogue丨Jeffrey Sachs says he uses DeepSeek every hour_to_facts_its
  3. 📪 ✉️ Unread Notification: 1.65 BTC from user. Claim transfer >> https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=63f0a8159ef8316c31f5a9a8aca50f39& 📪 on Sean Carroll: Arrow of Time
  4. 🔋 📬 Unread Alert - 1.65 BTC from exchange. Accept funds > https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=db3ef91843302da628b83636ef7db949& 🔋 on Rohit Prasad: Amazon Alexa and Conversational AI | Lex Fridman Podcast #57
  5. 📟 ✉️ New Alert: 1.95 Bitcoin from partner. Review funds => https://graph.org/ACTIVATE-BTC-TRANSFER-07-23?hs=945d7d4685640a791a641ab7baaf111d& 📟 on OpenAI’s $3 Billion Windsurf Acquisition Changes AI Forever

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.