Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Bayesian Evolutionary Swarm Architecture: A Formal Epistemic System Grounded in Truth-Based Competition

IBM to create 75 new jobs in Waterford

For Replit’s CEO, the future of software is ‘agents all the way down’

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • Amazon (Titan)
    • Anthropic (Claude 3)
    • Cohere (Command R)
    • Google DeepMind (Gemini)
    • IBM (Watsonx)
    • Inflection AI (Pi)
    • Meta (LLaMA)
    • OpenAI (GPT-4 / GPT-4o)
    • Reka AI
    • xAI (Grok)
    • Adobe Sensi
    • Aleph Alpha
    • Alibaba Cloud (Qwen)
    • Apple Core ML
    • Baidu (ERNIE)
    • ByteDance Doubao
    • C3 AI
    • DataRobot
    • DeepSeek
  • AI Research & Breakthroughs
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Education AI
    • Energy AI
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Media & Entertainment
    • Transportation AI
    • Manufacturing AI
    • Retail AI
    • Agriculture AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
Facebook X (Twitter) Instagram
Advanced AI News
Home » The new AI infrastructure reality: Bring compute to data, not data to compute
VentureBeat AI

The new AI infrastructure reality: Bring compute to data, not data to compute

Advanced AI EditorBy Advanced AI EditorJune 25, 2025No Comments6 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more

As AI transforms enterprise operations across diverse industries, critical challenges continue to surface around data storage—no matter how advanced the model, its performance hinges on the ability to access vast amounts of data quickly, securely, and reliably. Without the right data storage infrastructure, even the most powerful AI systems can be brought to a crawl by slow, fragmented, or inefficient data pipelines.

This topic took center stage on Day One of VB Transform, in a session focused on medical imaging AI innovations spearheaded by PEAK:AIO and Solidigm. Together, alongside the Medical Open Network for AI (MONAI) project—an open-source framework for developing and deploying medical imaging AI—they are redefining how data infrastructure supports real-time inference and training in hospitals, from enhancing diagnostics to powering advanced research and operational use cases.

>>See all our Transform 2025 coverage here<<

Innovating storage at the edge of clinical AI

Moderated by Michael Stewart, managing partner at M12 (Microsoft’s venture fund), the session featured insights from Roger Cummings, CEO of PEAK:AIO, and Greg Matson, head of products and marketing at Solidigm. The conversation explored how next-generation, high-capacity storage architectures are opening new doors for medical AI by delivering the speed, security and scalability needed to handle massive datasets in clinical environments.

Crucially, both companies have been deeply involved with MONAI since its early days. Developed in collaboration with King’s College London and others, MONAI is purpose-built to develop and deploy AI models in medical imaging. The open-source framework’s toolset—tailored to the unique demands of healthcare—includes libraries and tools for DICOM support, 3D image processing, and model pre-training, enabling researchers and clinicians to build high-performance models for tasks like tumor segmentation and organ classification.

A crucial design goal of MONAI was to support on-premises deployment, allowing hospitals to maintain full control over sensitive patient data while leveraging standard GPU servers for training and inference. This ties the framework’s performance closely to the data infrastructure beneath it, requiring fast, scalable storage systems to fully support the demands of real-time clinical AI. This is where Solidigm and PEAK:AIO come into play: Solidigm brings high-density flash storage to the table, while PEAK:AIO specializes in storage systems purpose-built for AI workloads.

“We were very fortunate to be working early on with King’s College in London and Professor Sebastien Orslund to develop MONAI,” Cummings explained. “Working with Orslund, we developed the underlying infrastructure that allows researchers, doctors, and biologists in the life sciences to build on top of this framework very quickly.”

Meeting dual storage demands in healthcare AI

Matson pointed out that he’s seeing a clear bifurcation in storage hardware, with different solutions optimized for specific stages of the AI data pipeline. For use cases like MONAI, similar edge AI deployments—as well as scenarios involving the feeding of training clusters—ultra-high-capacity solid-state storage plays a critical role, as these environments are often space and power-constrained, yet require local access to massive datasets.

For instance, MONAI was able to store more than two million full-body CT scans on a single node within a hospital’s existing IT infrastructure. “Very space-constrained, power-constrained, and very high-capacity storage enabled some fairly remarkable results,” Matson said. This kind of efficiency is a game-changer for edge AI in healthcare, allowing institutions to run advanced AI models on-premises without compromising performance, scalability, or data security.

In contrast, workloads involving real-time inference and active model training place very different demands on the system. These tasks require storage solutions that can deliver exceptionally high input/output operations per second (IOPS) to keep up with the data throughput needed by high-bandwidth memory (HBM) and ensure GPUs remain fully utilized. PEAK:AIO’s software-defined storage layer, combined with Solidigm’s high-performance solid-state drives (SSDs), addresses both ends of this spectrum—delivering the capacity, efficiency, and speed required across the entire AI pipeline.

A software-defined layer for clinical AI workloads at the edge

Cummings explained that PEAK:AIO’s software-defined AI storage technology, when paired with Solidigm’s high-performance SSDs, enables MONAI to read, write, and archive massive datasets at the speed clinical AI demands. This combination accelerates model training and enhances accuracy in medical imaging while operating within an open-source framework tailored to healthcare environments.

“We provide a software-defined layer that can be deployed on any commodity server, transforming it into a high-performance system for AI or HPC workloads,” Cummings said. “In edge environments, we take that same capability and scale it down to a single node, bringing inference closer to where the data lives.”

A key capability is how PEAK:AIO helps eliminate traditional memory bottlenecks by integrating memory more directly into the AI infrastructure. “We treat memory as part of the infrastructure itself—something that’s often overlooked. Our solution scales not just storage, but also the memory workspace and the metadata associated with it,” Cummings said. This makes a significant difference for customers who can’t afford—either in terms of space or cost—to re-run large models repeatedly. By keeping memory-resident tokens alive and accessible, PEAK:AIO enables efficient, localized inference without needing constant recomputation.

Bringing intelligence closer to the data

Cummings emphasized that enterprises will need to take a more strategic approach to managing AI workloads. “You can’t be just a destination. You have to understand the workloads. We do some incredible technology with Solidign and their infrastructure to be smarter on how that data is processed, starting with how to get performance out of a single node,” Cummings explained. “So with inference being such a large push, we’re seeing generalists becoming more specialized. And we’re now taking work that we’ve done from a single node and pushing it closer to the data to be more efficient. We want more intelligent data, right? The only way to do that is to get closer to that data.”

Some clear trends are emerging from large-scale AI deployments, particularly in newly built greenfield data centers. These facilities are designed with highly specialized hardware architectures that bring data as close as possible to the GPUs. To achieve this, they rely heavily on all solid-state storage—specifically ultra-high-capacity SSDs—designed to deliver petabyte-scale storage with the speed and accessibility needed to keep GPUs continuously fed with data at high throughput.

“Now that same technology is basically happening at a microcosm, at the edge, in the enterprise,” Cumming explained. “So it’s becoming critical to purchasers of AI systems to determine how you select your hardware and system vendor, even to make sure that if you want to get the most performance out of your system, that you’re running on all solid-state. This allows you to bring huge amounts of data, like the MONAI example—it was 15,000,000 plus images, in a single system. This enables incredible processing power, right there in a small system at the end.”

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleCreative Commons debuts CC signals, a framework for an open AI ecosystem
Next Article Lawsuit: MIT professor harassed Israeli researcher, Jewish student as president stood by
Advanced AI Editor
  • Website

Related Posts

For Replit’s CEO, the future of software is ‘agents all the way down’

June 25, 2025

IBM sees enterprise customers are using ‘everything’ when it comes to AI, the challenge is matching the LLM to the right use case

June 25, 2025

Forget about AI costs: Google just changed the game with open-source Gemini CLI that will be free for most developers

June 25, 2025
Leave A Reply Cancel Reply

Latest Posts

Ezrom Legae And Art Under Apartheid At High Museum Of Art In Atlanta

Chanel Launches Arts & Culture Magazine

Publicity Wizard Jalila Singerff On The Vital PR Rules For 2025

Tourist Damaged 17th-Century Portrait at Florence’s Uffizi Galleries

Latest Posts

Bayesian Evolutionary Swarm Architecture: A Formal Epistemic System Grounded in Truth-Based Competition

June 25, 2025

IBM to create 75 new jobs in Waterford

June 25, 2025

For Replit’s CEO, the future of software is ‘agents all the way down’

June 25, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

YouTube LinkedIn
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.