Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Mankind Pharma collaborates with OpenAI to build agile supply chains, research

Kleiner Perkins-backed voice AI startup Keplar aims to replace traditional market research

Petrobras launches tenders in Brazil for IBM, Siemens and Huawei technologies

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Crunchbase AI

The Wrong Way To Think About Implementing AI Agents

By Advanced AI EditorJuly 21, 2025No Comments6 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


By Sagi Eliyahu

Recently, analysts at Gartner published a bold prediction regarding the future of agentic AI in the enterprise: more than 40% of in-progress agentic AI projects will be canceled by the end of 2027.

This would seem to support other recent findings from related studies on AI agents in the enterprise. Earlier this year, for example, researchers at Carnegie Mellon University conducted an interesting-yet-flawed experiment: They staffed a fake software company, TheAgentCompany, entirely with AI agents. They asked the agents — each powered by a specific LLM — to take on the day-to-day work of a modern software company. They assigned the agents work, and that was about it so far as instruction or orchestration. After that, they asked them to get to work.

Sagi Eliyahu/Tonkean
Sagi Eliyahu of Tonkean

AI agents have been the subject of frenzied excitement in the enterprise, with such prominent CEOs as Mark Benioff, Jensen Huang, Satya Nadella and Mark Zuckerberg all predicting their impending, transformative preeminence.

CMU’s experiment, therefore, garnered lots of interest. But as outlets like Business Insider have reported, the results were not good. The best-performing agent finished just 24% of the jobs assigned to it. Most completed just 10%. It cost each agent on average $6 to complete an individual task, which added up quickly, since the jobs the agents had been assigned required completing many different tasks. Simple tasks stalled due to agents’ inability to overcome unexpected challenges, like dismissing a pop-up ad.

Observers were quick to interpret these results — along with results of still more studies conducted over the past year or so — as evidence that AI agents are perhaps not quite as capable as tech CEOs have made them out to be.

“[AI agents] are clearly not ready for more complex gigs humans excel at,” Futurism’s Joe Wilkins wrote.

Here’s how Business Insider’s Shubham Agarwal put it: “The findings, along with other emerging research about AI agents, complicate the idea that an AI agent workforce is just around the corner — there’s a lot of work they simply aren’t good at.”

Agarwal concluded the experiment was a “total disaster.”

This, however, is the incorrect conclusion to draw — incomplete at best and irrelevant at its core.

Augment, don’t replace

That’s because it stems from a flawed premise — specifically, that AI agents should be expected to replace humans outright. They’re not. They’re meant to augment them.

The agents in CMU’s experiment, in other words, were set up to fail. The culprit in the experiment was not the capacity of the agents themselves, but a misapplication of their purpose.

This, interestingly, is what underpins Gartner’s recent research into AI agents in the enterprise. According to Anushree Verma, a senior director analyst at Gartner, many in-progress AI agent deployments will fail ultimately because, “They are mostly driven by hype and are often misapplied.”

What CMU’s experiment ultimately showcases is precisely this: what happens when agentic AI rollouts stem foremost from such misapplication. It proves not that agents can’t complete complex work, but rather that CMU simply attempted to implement AI agents in entirely the wrong way.

So what’s a better way? To start, we shouldn’t treat this technology as magic.

AI agents, simply put, are tools. They’re not human replacements. They’re things for humans to use.

And just like any tool, the value humans derive from agents comes down not just to how smart or powerful individual agents are, but how strategically we leverage them to improve our own capacity.

Setting a bunch of specialized AI agents loose inside an organization without structures governing how they should work with each other or with human workers — not to mention without connecting them to the various departments, systems and policy centers, such that they can be orchestrated across them — simply isn’t very strategic.

In fact, it’s not a strategic way to leverage any tool, resource or intelligent entity, humans included. Try the same experiment, but substitute AI agents with highly intelligent human workers. Let those workers loose inside your organization without roles, responsibilities, organization or protocol, and you’ll get the same result: noisy, inefficient, expensive chaos.

LLMs can’t deliver consistently good work or work effectively together toward a common set of goals without other supporting technology or infrastructure.

So what might be more useful instead? If the goal is to determine what, ultimately, AI agents are capable of in an enterprise context, we should experiment with them using conditions consistent with an enterprise context. And we should ensure there’s adequate structure in place behind the scenes — such as end-to-end orchestration infrastructure — enabling AI agents to deliver genuine enterprise value.

Structure and strategy matter

People who believe AI agents are exciting because they’ll replace humans have it all wrong. AI agents are exciting not because they’ll replace humans, but because they’ll replace traditional enterprise software.

It’s in this way that AI agents could transform the enterprise — by improving not just the capacity of human-led organizations, but the experiences provided human workers inside them.

But only if we will it. For organizations of every sort — from TheAgentCompany to Alphabet to those surveyed by Gartner — getting transformational value out of agents will come down to one thing: how strategically we integrate them into the infrastructure of our day-to-day operations, and what sort of structures we put in place to govern them.

This is as true of AI agents as it is of any other sort of intelligent entity we leverage inside the enterprise, including humans. Intelligent entities need structure to work effectively. You want intelligent entities to be able to work autonomously and creatively in pursuit of the goals you set for them. But to effectively pursue those goals, you also need direction and hierarchy, governance and org charts, processes and rules.

It’s on what sort of structure we put around AI agents, in order to maximize their impact for humans, that we should be iterating and experimenting.

This is a matter, in the end, not only of strategy and performance, but of security; thinking carefully about how we construct and deploy AI agents in the enterprise, for example, is how we will wall off AI from things it shouldn’t be touching internally, such as login credentials, sensitive data or certain actions.

It’s also, however, the only way we’ll ever truly find out just what this technology is capable of. Anything else is a waste of time.

Sagi Eliyahu is the co-founder and CEO of Tonkean, an AI-powered intake and orchestration platform that helps enterprise-shared service teams such as procurement, legal, IT and HR create processes that people actually follow. Tonkean’s agents use AI to anticipate employees’ needs and guide them through their requests.

Related reading:


Stay up to date with recent funding rounds, acquisitions, and more with the
Crunchbase Daily.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleFunction of Beauty names new CEO
Next Article Lululemon expands to Italy with first store in Milan
Advanced AI Editor
  • Website

Related Posts

Robotics Funding Crests Higher As Figure Lands Another $1B

September 16, 2025

PwC Deal Lead On AI’s Hottest M&A Targets

September 16, 2025

WorkFusion, With Several Big Banks As Customers, Lands $45M For AI Agents ‘To Stop Bad Actors’

September 16, 2025

Comments are closed.

Latest Posts

Jennifer Packer and Marie Watt Win $250,000 Heinz Award

KAWS Named Uniqlo’s First Artist-in-Residence

Jeffrey Gibson Talks About Animals at Unveiling of New Sculptures at the Met

‘New Yorker’ Commissions High-Profile Artists for Anniversary Covers

Latest Posts

Mankind Pharma collaborates with OpenAI to build agile supply chains, research

September 17, 2025

Kleiner Perkins-backed voice AI startup Keplar aims to replace traditional market research

September 17, 2025

Petrobras launches tenders in Brazil for IBM, Siemens and Huawei technologies

September 17, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Mankind Pharma collaborates with OpenAI to build agile supply chains, research
  • Kleiner Perkins-backed voice AI startup Keplar aims to replace traditional market research
  • Petrobras launches tenders in Brazil for IBM, Siemens and Huawei technologies
  • Dreamy Collaboration! Roewe M7 DMH Globally First Equipped with Doubao Deep Thinking Large Model_the_vague
  • Cohere opens Paris office to create European hub for its AI business

Recent Comments

  1. Stevenced on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  2. BrentRounk on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  3. Stevenced on 1-800-CHAT-GPT—12 Days of OpenAI: Day 10
  4. binance on Integrate Amazon Bedrock Agents with Slack
  5. Richardsmeap on [2405.19874] Is In-Context Learning Sufficient for Instruction Following in LLMs?

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.