Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

A New Trick Could Block the Misuse of Open Source AI

C3 AI Stock Is Soaring Today: Here’s Why – C3.ai (NYSE:AI)

Nvidia Faces $8B Hit as U.S. Halts H20 AI Chip Exports to China

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • Adobe Sensi
    • Aleph Alpha
    • Alibaba Cloud (Qwen)
    • Amazon AWS AI
    • Anthropic (Claude)
    • Apple Core ML
    • Baidu (ERNIE)
    • ByteDance Doubao
    • C3 AI
    • Cohere
    • DataRobot
    • DeepSeek
  • AI Research & Breakthroughs
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Education AI
    • Energy AI
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Media & Entertainment
    • Transportation AI
    • Manufacturing AI
    • Retail AI
    • Agriculture AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
Advanced AI News
Home » IBM z17 brings multi-model AI to transaction processing • The Register
IBM

IBM z17 brings multi-model AI to transaction processing • The Register

Advanced AI BotBy Advanced AI BotJanuary 30, 2025No Comments5 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


IBM’s latest mainframe builds on the platform’s traditional attributes of security and reliability for mission-critical workloads, adding AI to support large language models (LLMs), assistants, and agents.

ibm newest mainframe system ibm z17

The new z17 mainframe system is available from June

The z17 family introduces an improved Telum II processor and Spyre AI Accelerator card, both of which were discussed at the Hot Chips conference in Palo Alto last year, for a claimed speed bump of 7.5 times the AI performance of the z16.

While the Telum II offers improved AI inferencing for running fraud detection checks against transactions – as was introduced with the z16 – the Spyre cards provide a way to scale AI handling to support generative AI and LLMs, and the use of multiple models to improve accuracy and reduce false positives, IBM claims.

“If you look at data as the new fuel, then infrastructure is the engine that allows organizations to drive their AI journeys to success,” said Elpida Tzortzatos, IBM Fellow and IBM Z Architect, referring to the hardware enhancements Big Blue has developed for this latest big iron.

The firm says it has spent a lot of time talking to clients about what they wanted to see in the mainframe, and this informed the development of the z17. Modernizing their applications and enabling mainframes to be more AI-driven is apparently what the customers told them.

ibm Spyre and Telum II chips

The Spyre and Telum II chips

But it isn’t a case of just throwing generative AI into the mix, as some other companies may have done. Big Blue claims to have thought this through carefully.

“GenAI is very critical and important to our clients, but also not the only AI tool. And although there’s a lot of talk around GenAI these days, predictive AI will continue to play a critical role in enterprises,” Tzortzatos said.

“We’ll continue to serve those use cases very, very well, but GenAI opens the aperture for new use cases, such as having assistants and being able to summarize documents, being able to provide support to developers in terms of having copilots that do code autocomplete and so forth.”

These assistants include the firm’s watsonx Code Assistant for Z and watsonx Assistant for Z, for example.

A new trend that the firm sees emerging is combining both the strengths of predictive AI with the strengths of large language and code models to extract new features or new insights, and get better and more accurate results out of these AI models, Tzortzatos claimed.

She cited an example of insurance where companies are pulling the structured information relating to claims from a DB2 database, then extracting key insights such as the cause of the claim, or the urgency of it from unstructured text and feeding it into a predictive AI model to get better, more accurate results.

As detailed at Hot Chips, the Telum II processors in the z17 are eight-core chips, like the previous generation, but running at a higher 5.5 GHz clock speed. Telum II also features a 40 percent increase in cache size and another new capability – an on-chip IO accelerator or data processing unit (DPU), which is designed to offload huge volumes of data that the Spyre AI Accelerator cards will churn through while handling newer AI models.

“When it comes to large language models and GenAI, we’ve seen a factor bigger than a hundred in terms of model complexity and model size increase, and this leads to higher requirements for AI compute,” Tzortzatos explained.

Those Spyre AI Accelerator cards fit into PCIe slots, and feature up to 32 cores each, said to be a similar architecture to the AI accelerator in the Telum II chip itself. IBM says it is possible for the z17 to have up to 48 of the cards in a single system.

Big Blue is also readying z/OS 3.2, the next release of its chief operating system for IBM Z systems, which is planned for the third quarter of this year. This brings support for hardware-accelerated AI capabilities across the system and uses operational AI for system management capabilities.

The new platform will add support for modern data access methods, NoSQL databases, and hybrid cloud data processing, according to IBM, to allow AI to tap into a broader set of enterprise data from which to apply predictive business insights.

IBM is launching its new big iron at a tricky time for such big-ticket items, with the Trump administration’s approach to international trade shaking business confidence. Traditionally, the introduction of a new mainframe sees a spike in revenue for Big Blue as customers with older systems upgrade, but this year could prove a difficult sell.

However, Mike Chuba, Managing VP in Gartner’s Infrastructure and Operations group, said the company has done its homework on what customers want to see.

“If you look at the last several mainframe generation announcements and continuing with this one, IBM is spending a lot more time in its R&D process involving its big mainframe customers,” Chuba told The Register.

“IBM’s R&D efforts now focus on how the new hardware directly addresses the challenges its customers are facing. The focus on AI with the dedicated accelerator they introduced on the z16 and the turbocharged Version 2 coming with this generation directly addresses, for example, the challenge of fraud detection at the point of the transaction.”

IBM’s z17 systems will be generally available June 18, while the Spyre Accelerator cards are expected to be available in the fourth quarter. ®



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleLooking at Ethanol’s Lifecycle Emissions from ‘From Well to Wheels’
Next Article ElevenLabs, the hot AI audio startup, confirms $180M in Series C funding at a $3.3B valuation
Advanced AI Bot
  • Website

Related Posts

Will the Launch of watsonx AI Labs Be a Game Changer for IBM? – June 5, 2025

June 6, 2025

Famed Short Seller Jim Chanos Is Betting Against Used Car Retailer Carvana And AI Losers Like IBM

June 6, 2025

IBM Endicott’s Amazing Vanishing Act

June 6, 2025
Leave A Reply Cancel Reply

Latest Posts

The Timeless Willie Nelson On Positive Thinking

Jiaxing Train Station By Architect Ma Yansong Is A Model Of People-Centric, Green Urban Design

Midwestern Grotto Tradition Celebrated In Sheboygan, WI

Hugh Jackman And Sonia Friedman Boldly Bid To Democratize Theater

Latest Posts

A New Trick Could Block the Misuse of Open Source AI

June 8, 2025

C3 AI Stock Is Soaring Today: Here’s Why – C3.ai (NYSE:AI)

June 8, 2025

Nvidia Faces $8B Hit as U.S. Halts H20 AI Chip Exports to China

June 8, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

YouTube LinkedIn
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.