Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

White House plan signals “open-weight first” era—and enterprises need new guardrails

A new AI coding challenge just published its first results – and they aren’t pretty

Enhance generative AI solutions using Amazon Q index with Model Context Protocol – Part 1

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
IBM

IBM Cloud is First Service Provider to Deploy Intel Gaudi 3

By Advanced AI EditorMay 1, 2025No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


IBM is the first cloud service provider to make Intel® Gaudi® 3 AI accelerators available to customers, a move designed to make powerful artificial intelligence capabilities more accessible and to directly address the high cost of specialized AI hardware.

For Intel, the rollout on IBM Cloud marks the first major commercial deployment of Gaudi 3, bringing choice to the market. By leveraging Intel Gaudi 3 on IBM Cloud, the two companies aim to help clients cost-effectively test, innovate and deploy GenAI solutions.

According to a recent forecast by research firm Gartner, worldwide generative AI (GenAI) spending is expected to total $644 billion in 2025, an increase of 76.4% from 2024. The research found “GenAI will have a transformative impact across all aspects of IT spending markets, suggesting a future where AI technologies become increasingly integral to business operations and consumer products.”

For many enterprise customers, the benefits are clear when tools like GenAI automate tasks, improve workflows and drive innovation. But deploying AI applications demands significant computing power, often requiring expensive specialized processors that can keep many businesses from benefiting from AI.

Gaudi 3 AI accelerators are specifically designed to help meet the exploding demands for GenAI, large model inferencing and model fine-tuning while supporting an open development framework. Gaudi 3 is also ideal for multimodal large language models (LLMs) and retrieval-augmented generation (RAG).

“By bringing Intel Gaudi 3 AI accelerators to IBM Cloud, we’re enabling businesses to help scale generative AI workloads with optimized performance for inferencing and fine-tuning,” said Saurabh Kulkarni, vice president of Data Center AI Strategy at Intel. “This collaboration underscores our shared commitment to making AI more accessible and cost-effective for enterprises worldwide.”

How Enterprise Customers Use IBM Cloud

IBM Cloud serves a range of enterprise customers, particularly those in regulated industries, such as financial services, healthcare and life sciences, and the public sector.

Banks and insurance companies use the cloud for fraud detection or personalized customer service, while healthcare providers use it for accelerating drug discovery and development, AI-driven diagnostics, telemedicine platforms and real-time patient monitoring. Retailers use cloud technology for e-commerce platforms or inventory management. It’s also a go-to for companies looking to modernize old systems without giving up control or security.

Gaudi 3 is now available in the IBM Cloud regions of Frankfurt, Germany; Washington, D.C.; and Dallas, Texas.

Gaudi 3 is also being integrated into IBM’s broader AI infrastructure offerings. Customers can use Gaudi 3 via IBM Cloud Virtual Servers on the IBM Virtual Private Cloud (VPC) now. Customers will also be able to deploy across architectures starting in the second half of 2025. Support for Red Hat OpenShift and IBM’s watsonx AI platform is expected to be available this quarter.

“The ability to handle more data, and have higher performance, all of this is going to drive better adoption of AI for customers worldwide,” says Satinder Sethi, general manager of IBM Cloud Infrastructure Services. “Intel Gaudi 3 is giving customers more choice, more freedom and a more cost-effective platform of which AI hardware they want to use.”

Cost and Performance Comparisons

Intel Gaudi 3 AI accelerators are designed to tackle the cost challenge by balancing performance and price. New AI inferencing benchmark tests conducted by research firm Signal65, and commissioned by Intel, found Gaudi 3 is 92% more cost efficient (performance per dollar) than competition when running on Meta’s Llama-3.1-405B-Instruct-FP8 model with large context sizes1.

Cost efficiency is a crucial metric because it allows businesses to do more AI processing for the same investment or the same amount of processing at a lower cost. Performance gains are intended to lower the cost barrier for companies looking to deploy or fine-tune models, particularly as GenAI adoption spreads.

Throughput or performance measurements refer to the amount of AI processing the accelerator can perform in each time, also known as tokens per second. Gaudi 3 delivers significantly faster AI processing than the competition. On the IBM Granite-3.1-8B-Instruct model, Gaudi 3 provided 43% more tokens per second for small AI workloads1, and 36% more tokens per second with large context sizes compared to competition when running Meta’s Llama-3.1-405B-Instruct-FP8 model1.

More: IBM Empowers Enterprises to Scale AI (Intel.com) | Intel and IBM Announce the Availability of Intel Gaudi 3 AI Accelerators on IBM Cloud (IBM)



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleResearchers improve the accuracy of AI writing tools | Ingram School of Nursing
Next Article The American Civil Liberties Union of Texas Names Artists-in-Residence
Advanced AI Editor
  • Website

Related Posts

Stocks making the biggest moves after hours: NOW, IBM, CMG

July 23, 2025

IBM Shares Slide After Q2 Results: EPS Beat, Revenues Beat – IBM (NYSE:IBM)

July 23, 2025

Amazon follows IBM and Microsoft, shuts China AI lab that generated nearly $1 billion in sales

July 23, 2025
Leave A Reply

Latest Posts

Winston Artory Merger Targets $15B Art Valuation Market

Denver Museum Discovers 67.5 Million-Year-Old Fossil Under Parking Lot

Taipei Dangdai Cancels 2026 Edition

Barnes Foundation Online Learning Platform Expands to Penn Museum

Latest Posts

White House plan signals “open-weight first” era—and enterprises need new guardrails

July 24, 2025

A new AI coding challenge just published its first results – and they aren’t pretty

July 24, 2025

Enhance generative AI solutions using Amazon Q index with Model Context Protocol – Part 1

July 23, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • White House plan signals “open-weight first” era—and enterprises need new guardrails
  • A new AI coding challenge just published its first results – and they aren’t pretty
  • Enhance generative AI solutions using Amazon Q index with Model Context Protocol – Part 1
  • Alibaba claims world’s most advanced agentic AI model for coding
  • Google set up two robotic arms for a game of infinite table tennis

Recent Comments

  1. 1win app download on Former Tesla AI czar Andrej Karpathy coins ‘vibe coding’: Here’s what it means
  2. 📃 ✉️ Pending Deposit: 1.8 BTC from new sender. Review? > https://graph.org/REDEEM-BTC-07-23?hs=60194a6753699dfb5804798d5843ffd0& 📃 on This Neural Network Optimizes Itself | Two Minute Papers #212
  3. 📉 📩 Pending Deposit - 1.0 BTC from unknown sender. Review? => https://graph.org/REDEEM-BTC-07-23?hs=16ed4f83e039fc01f975372e66ec05d7& 📉 on OpenAI seeks to make its upcoming ‘open’ AI model best-in-class
  4. 📊 📩 Pending Transfer: 1.8 BTC from unknown sender. Approve? >> https://graph.org/REDEEM-BTC-07-23?hs=8f64f5846f6d90e5a1ebb4bba272bbea& 📊 on Nvidia’s GB200 NVL72 Supercomputer Achieves 2.7× Faster Inference on DeepSeek V2
  5. 📅 ✉️ New Deposit: 1.8 BTC from new sender. Approve? > https://graph.org/REDEEM-BTC-07-23?hs=5719fe560af3b8c36c0a0976ea7a6f6b& 📅 on Meta, Booz Allen develop ‘Space Llama’ AI system for the International Space Station

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.