Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

EU Commission: “AI Gigafactories” to strengthen Europe as a business location

United States, China, and United Kingdom Lead the Global AI Ranking According to Stanford HAI’s Global AI Vibrancy Tool

Foundation AI: Cisco launches AI model for integration in security applications

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • Adobe Sensi
    • Aleph Alpha
    • Alibaba Cloud (Qwen)
    • Amazon AWS AI
    • Anthropic (Claude)
    • Apple Core ML
    • Baidu (ERNIE)
    • ByteDance Doubao
    • C3 AI
    • Cohere
    • DataRobot
    • DeepSeek
  • AI Research & Breakthroughs
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Education AI
    • Energy AI
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Media & Entertainment
    • Transportation AI
    • Manufacturing AI
    • Retail AI
    • Agriculture AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
Advanced AI News
Home » Google Cloud Preps for Agentic AI Era with ‘Ironwood’ TPU, New Models and Software
Industry Applications

Google Cloud Preps for Agentic AI Era with ‘Ironwood’ TPU, New Models and Software

Advanced AI BotBy Advanced AI BotApril 11, 2025No Comments7 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Google Cloud Ironwood TPU on a rack. (Source: Google Cloud)

Google Cloud is gearing up for the agentic AI era in a big way, and its showing off its new wares this week at its NEXT conference. The company unveiled a slew of new AI models and new software for developing and managing AI agents, as well as the seventh generation of the processor at the heart of its AI Hypercomputer, a TPU dubbed Ironwood, which Google says is twice as power efficient as the previous generation.

Google Cloud is seeing AI workloads shifting from model training to inference workloads, which is a trend that Nvidia also observed during its recent GTC conference. The seventh-gen Ironwood TPU was built from the ground up for inferencing at scale, according to Amin Vahdat, the company’s vice president of ML, systems and cloud AI. And oh my, what scale.

“Ironwood will scale to over 9,000 chips per pod to meet the exponentially growing demands of thinking models like Gemini 2.5,” Vahdat said during a press conference on Monday. “This scale will deliver a staggering 42.5 exaflops of compute per pod.”

For perspective, the world’s number one supercomputer, El Capitan, supports 1.7 Exaflops per pod, Vahdat said. By comparison, Ironwood running on Google Cloud’s TPU-based AI Hypercomputer will deliver more than 24 times the compute power of El Capitan, he pointed out.

Much of that compute power will go toward serving the burgeoning demand for AI workloads, he said. “We’ve seen a 10x year over year increase in demand for training and serving models,” Vahdat continued. “Innovations throughout the TPU architecture, such as liquid cooling and optical switching, have led to 100 times improvements in sustained performance relative to conventional architecture design.”

Google Cloud has made a few other improvements to its service to help customers put all that power to use. For instance, its making its internal advanced networking technology to, dubbed Google Cloud WAN, available to customers for the first time.

“Our customers can now tap into the same planet scale network that powers Google’s globally available services, including Gmail, YouTube, and search,” Vahdat said. “No other technology company can offer this to its customers.”

Google Cloud’s seventh-generation TPU, Ironwood. (Source: Google Cloud)

It also is making its own internal machine learning runtime, dubbed Next Pathways, available to customers. “Developed by Google DeepMind, Pathways on Google Cloud allows customers to scale out model serving to hundreds of TPUs with exceptional performance,” Vahdat said.

Google develops one of the world’s most capable foundation models, Gemini 2.5 Pro. The reasoning model, which is available through its Vertex AI service, is capable of breaking up complex problems and using multi-stepped thought processes to deliver accurate answers in demanding environments, such as drug discovery, financial modeling, and risk management, Vahdat said.

Soon Google Cloud customers will have a more affordable version of that model, dubbed Gemini 2.5 Flash. “Gemini 2.5 Flash is more affordable for everyday use cases,” Vahdat said. “The model gives cloud customers the ability for fast responses and high volume customer interactions. It can quickly generate real time summaries of documents or news, and can assist with basic coding tasks and function calling where responsiveness is important.”

Reasoning models such as Gemini 2.5 Flash will be widely used for AI agents, which are rapidly progressing in capability and usability. Google Cloud is using its NEXT conference to roll out a slew of additional software to help customers develop and manage their new robot workers.

For starters, Google Cloud is rolling out a new Agent Development Kit (ADK), which it bills as a “unified development environment” that “makes it easy to build, test and operate these agents,” Vahdat said.

“With ADK, customers can easily build a multi-agent system in under 100 lines of code and precisely steer agent behavior with creative reasoning and strict guardrails,” the Google VP said. “Customers can go from concept to testing, with real data and assets, to running with security and compliance in production in less than a week.”

Ironwood’s FLOPS per watt. (Source: Google Cloud)

Since growing new crops of AI agents will be so important, why not have a garden devoted to it? That’s essentially what Google Cloud is enabling with its aptly named Agent Garden, which Vahdat called a collection of ready to use samples and tools directly accessible in SDK. The Agent Garden will make it easy for users to connect agents to 100 plus pre-built connectors, as well as to custom APIs, other integration workflows, or data stored in customers cloud systems. It will also support Model Context Protocol (MCP), the new protocol developed by Anthropic to connect data with models

Google Cloud is supporting MCP, which appears to have the early lead in the search for industry standard protocols. But there’s also room for an Agent to Agent protocol, which is something that Google Cloud just announced. A2A, as it’s called, will be geared at enabling agents to call and connect to other agents, as opposed to AI models and tools, which is the focus with MCP, Vehdat said.

But wait, there’s more agentic AI from Google Cloud! The company is rolling out an AI Agent Marketplace where customers can search for and select from a slew of partner-developed AI agents to use in their Google Cloud environment. And Google Cloud is also launching Google Agent Space, which is designed to provide organizations a clearinghouse of sorts to share information about AI agents to employees.

Google Cloud also provides a slew of AI agents to handle a range of data engineering, data science, and data analytics tasks. It is using Google Cloud Next to unveil enhancements to these agents, too.

The company is launching a handful for new specialized data agents for data engineering and data science at NEXT, according to Brad Calder, vice president and GM of Google Cloud. Its adding agents directly into BigQuery pipelines to build data pipelines. It’s also adding agents to perform data prep tasks, such as transformation and enrichment, and another specifically for anomaly detection.

Google Cloud’s Agent Engine functioning in AgentSpace. (Source: Google Cloud)

“We deliver agents for all aspects of the data engineering lifecycle, from catalog automation metadata generation to maintaining data quality to data pipeline generation,” Calder said during the press conference.

Data scientists will appreciate the new agent in Google’s Colab notebook, which will help with a range of tasks, including feature engineering, model selection, and training and iteration. Data security is also a focus for Google Cloud’s agentic development, and to that end, it is launching new two data engineering agents, one that analyzes security threats and another that analyzes malware.

Finally, Google Cloud is rolling out its new Gemini Code Assist Kanban board, which provides a real time display of the tasks that Google AI agents are working on, and also gives them the ability to interact with the agents.

Google Cloud has a ton more news at the show (the book of blogs it shared with reporters was nearly 200 pages). Keep BigDATAwire bookmarked for the most relevant bits.

This article first appeared on BigDATAwire.

Related

About the author: Alex Woodie

Alex Woodie has written about IT as a technology journalist for more than a decade. He brings extensive experience from the IBM midrange marketplace, including topics such as servers, ERP applications, programming, databases, security, high availability, storage, business intelligence, cloud, and mobile enablement. He resides in the San Diego area.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleC3 AI and Arcfield Announce Partnership to Accelerate AI Capabilities to Serve U.S. Defense and Intelligence Communities
Next Article Former Google CEO suggests building data centers in remote locations in case of nation-state attacks to slow down AI
Advanced AI Bot
  • Website

Related Posts

How SandboxAQ and Stand Up To Cancer Are Using AI to Transform Cancer Research

June 6, 2025

Winklevoss twins’ crypto firm Gemini confidentially files for IPO

June 6, 2025

Tesla launches new feature that cold climate drivers will love

June 6, 2025
Leave A Reply Cancel Reply

Latest Posts

Original Prototype for Jane Birkin’s Hermes Bag Consigned to Sotheby’s

Viral Trump Vs. Musk Feud Ignites A Meme Chain Reaction

UK Art Dealer Sentenced To 2.5 Years In Jail For Selling Art to Suspected Hezbollah Financier

Artists Accuse Dealer Reco Sturgis of Withholding Payments and Artworks

Latest Posts

EU Commission: “AI Gigafactories” to strengthen Europe as a business location

June 7, 2025

United States, China, and United Kingdom Lead the Global AI Ranking According to Stanford HAI’s Global AI Vibrancy Tool

June 7, 2025

Foundation AI: Cisco launches AI model for integration in security applications

June 7, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

YouTube LinkedIn
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.