Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Bloomberg Finds AI Data Centers Fueling America’s Energy Bill Crisis

Revisiting Valuation After Disappointing Results, Withdrawn Guidance, and Leadership Shift

HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video – Takara TLDR

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Industry Applications

AI’s Big Data Step Function

By Advanced AI EditorOctober 8, 2025No Comments5 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Conventional wisdom holds that you need to fix your data management shortcomings before succeeding with AI. But that may no longer be true, according to some tech execs, who see the potential to apply generative AI’s capability to grasp language and fix data management issues at the same time you’re building AI apps.

Rahul Pathak, the vice president of data and AI go to market at AWS, considers himself an old school data guy, the kind who would never recommend taking shortcuts in order to show success on paper. So when he suggests that GenAI might allow you jump ahead in your data management capabilities and get results faster, you might want to take notice.

“We were in a world where you would have to serialize your way through this, where you would have to get the data house in order, then you would have to build the app that sits on top of the data,” Pathak says. “I think you can actually change this process a little bit, where you can start to unlock your data for AI almost immediately, using well-governed, secure MCP endpoints and state of the art models. [They] can really help you unlock that data almost in place that can then start to help you light up the AI applications.”

Not all AI use cases are the same, obviously. Some use cases may require data to be collected, cleansed, and prepared before it touches an AI algorithm. But when it comes to running inference workloads on pre-trained models, staging the data may not be a possibility, in which case a federated approach would be in order. The good news is that Model Context Protocol (MCP) covers up a lot of data sins that may previously have required a considerable amount of atonement (not to mention data management pain and dollars).

“I’m somewhat of an old school data person at this point, but you can think of the MCP server as almost like a federated query,” Pathak said. “The model lets you get the data. It’s somewhat schema resilient. And then the knowledge base and the index is almost like a materialized view. And so in that combination, you can get the data much faster. And the intelligence in the models does augment the capabilities of the data engineer and data scientist in a way that really allows them to move much faster than we could before.”

Pathak had a real-world example in the form of a manufacturing company that wanted to use generative AI to speed up production. The company had reams of telemetry data already collected, but it was proving to be difficult and time-consuming to extract the knowledge out of that telemetry data to apply it to the factory line.

The solution was to use the natural language processing (NLP) capabilities of GenAI to extract the pertinent pieces of data out of the telemetry data. Those insights were then fed into traditional machine learning optimization models. On the back end, GenAI was used again to generate the instructions that told the operators how to modify their process to speed up production.

(IM Imagery/Shutterstock)

“It’s that kind of integration that allows us to move much faster than we could before,” Pathak said. “Because otherwise you’d have a big data and ETL and kind of data munging project that you’d have to do to get that telemetry off and usable quickly. And we can do that much, much faster now. So that’s a big unlock.”

Another proponent of skipping the big data management project and jumping straight into GenAI projects is PromptQL. The company developed a GenAI-based query tool that allows users to begin querying their data immediately, without going through the time-consuming process of building a semantic layer.

A semantic layer is still important, the folks at PromptQL say, because it serves to translate a business’ specific terms and metrics into the technical table names that the tool needs to serve accurate queries. But the big difference is that PromptQL advocates building the semantic layer as you go, and customizing it over time thanks to feedback from users. Spending months or years on a big-bang data management project is a path to endless POCs and ultimately failure, they say.

The high failure rate of early AI projects is the elephant in the room. The recent MIT study that found 95% of GenAI projects never get out of the trial stage has people on edge. With trillions of dollars being invested in acquiring speedy GPUs, massive storage arrays, and huge AI data centers, some very wealthy institutions are placing some big bets on AI.

Smaller companies with fewer resources have to be much smarter about how they attack the AI opportunity. The good news is that GenAI’s capability to grasp language can be employed in a multitude of ways, including using it to understand how data is modeled, which can potentially allow you to, if not skip the data management stage, at least tackle it at the same time that you’re building your first AI project.

“These aren’t sequential steps anymore,” Pathak says. “I think that’s a big paradigm shift for a lot of companies that are dealing with legacy data challenges, which, frankly, we’ve been dealing with since we had more than one table in the database.”

“I think what generative AI has done and what’s different now,” he says, “is that it’s really given us some superpowers to achieve these things.”

This article first appeared on our sister publication, HPCwire.

Related

About the author: Alex Woodie

Alex Woodie has written about IT as a technology journalist for more than a decade. He brings extensive experience from the IBM midrange marketplace, including topics such as servers, ERP applications, programming, databases, security, high availability, storage, business intelligence, cloud, and mobile enablement. He resides in the San Diego area.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleRefusal Falls off a Cliff: How Safety Alignment Fails in Reasoning? – Takara TLDR
Next Article OpenAI’s Nick Turley on transforming ChatGPT into an operating system
Advanced AI Editor
  • Website

Related Posts

Bloomberg Finds AI Data Centers Fueling America’s Energy Bill Crisis

October 8, 2025

Tesla will let you bring back this removed Model 3 part for a price

October 8, 2025

Nexl Bags $23m, Will Invest In Hires + Acquisitions – Artificial Lawyer

October 8, 2025

Comments are closed.

Latest Posts

Matthiesen Gallery Files Lawsuit Over Gustave Courbet Painting

MoMA Partners with Mattel for Van Gogh Barbie, Monet and Dalí Figures

Underground Film Legend and Artist Dies at 92

Artwork Forfeited by Inigo Philbrick’s Partner Flops at Sotheby’s

Latest Posts

Bloomberg Finds AI Data Centers Fueling America’s Energy Bill Crisis

October 8, 2025

Revisiting Valuation After Disappointing Results, Withdrawn Guidance, and Leadership Shift

October 8, 2025

HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video – Takara TLDR

October 8, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Bloomberg Finds AI Data Centers Fueling America’s Energy Bill Crisis
  • Revisiting Valuation After Disappointing Results, Withdrawn Guidance, and Leadership Shift
  • HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video – Takara TLDR
  • Facing Backlash, OpenAI Curbs Use of Dead Celebrities’ Likenesses in Sora
  • To scale agentic AI, Notion tore down its tech stack and started fresh

Recent Comments

  1. real tiktok views on VAST Data Powers Smarter, Evolving AI Agents with NVIDIA Data Flywheel
  2. Janene Hillock on Class Dismissed? Representative Claims in Getty v. Stability AI | Cooley LLP
  3. Bryanassit on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. EnriqueGen on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  5. Bryanassit on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.