Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

AI Testing and Evaluation: Reflections

Grok’s AI companions drove downloads, but its latest model is the one making money

Bias Crisis in Talent Acquisition

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
Crunchbase AI

The Wrong Way To Think About Implementing AI Agents

By Advanced AI EditorJuly 21, 2025No Comments6 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


By Sagi Eliyahu

Recently, analysts at Gartner published a bold prediction regarding the future of agentic AI in the enterprise: more than 40% of in-progress agentic AI projects will be canceled by the end of 2027.

This would seem to support other recent findings from related studies on AI agents in the enterprise. Earlier this year, for example, researchers at Carnegie Mellon University conducted an interesting-yet-flawed experiment: They staffed a fake software company, TheAgentCompany, entirely with AI agents. They asked the agents — each powered by a specific LLM — to take on the day-to-day work of a modern software company. They assigned the agents work, and that was about it so far as instruction or orchestration. After that, they asked them to get to work.

Sagi Eliyahu/Tonkean
Sagi Eliyahu of Tonkean

AI agents have been the subject of frenzied excitement in the enterprise, with such prominent CEOs as Mark Benioff, Jensen Huang, Satya Nadella and Mark Zuckerberg all predicting their impending, transformative preeminence.

CMU’s experiment, therefore, garnered lots of interest. But as outlets like Business Insider have reported, the results were not good. The best-performing agent finished just 24% of the jobs assigned to it. Most completed just 10%. It cost each agent on average $6 to complete an individual task, which added up quickly, since the jobs the agents had been assigned required completing many different tasks. Simple tasks stalled due to agents’ inability to overcome unexpected challenges, like dismissing a pop-up ad.

Observers were quick to interpret these results — along with results of still more studies conducted over the past year or so — as evidence that AI agents are perhaps not quite as capable as tech CEOs have made them out to be.

“[AI agents] are clearly not ready for more complex gigs humans excel at,” Futurism’s Joe Wilkins wrote.

Here’s how Business Insider’s Shubham Agarwal put it: “The findings, along with other emerging research about AI agents, complicate the idea that an AI agent workforce is just around the corner — there’s a lot of work they simply aren’t good at.”

Agarwal concluded the experiment was a “total disaster.”

This, however, is the incorrect conclusion to draw — incomplete at best and irrelevant at its core.

Augment, don’t replace

That’s because it stems from a flawed premise — specifically, that AI agents should be expected to replace humans outright. They’re not. They’re meant to augment them.

The agents in CMU’s experiment, in other words, were set up to fail. The culprit in the experiment was not the capacity of the agents themselves, but a misapplication of their purpose.

This, interestingly, is what underpins Gartner’s recent research into AI agents in the enterprise. According to Anushree Verma, a senior director analyst at Gartner, many in-progress AI agent deployments will fail ultimately because, “They are mostly driven by hype and are often misapplied.”

What CMU’s experiment ultimately showcases is precisely this: what happens when agentic AI rollouts stem foremost from such misapplication. It proves not that agents can’t complete complex work, but rather that CMU simply attempted to implement AI agents in entirely the wrong way.

So what’s a better way? To start, we shouldn’t treat this technology as magic.

AI agents, simply put, are tools. They’re not human replacements. They’re things for humans to use.

And just like any tool, the value humans derive from agents comes down not just to how smart or powerful individual agents are, but how strategically we leverage them to improve our own capacity.

Setting a bunch of specialized AI agents loose inside an organization without structures governing how they should work with each other or with human workers — not to mention without connecting them to the various departments, systems and policy centers, such that they can be orchestrated across them — simply isn’t very strategic.

In fact, it’s not a strategic way to leverage any tool, resource or intelligent entity, humans included. Try the same experiment, but substitute AI agents with highly intelligent human workers. Let those workers loose inside your organization without roles, responsibilities, organization or protocol, and you’ll get the same result: noisy, inefficient, expensive chaos.

LLMs can’t deliver consistently good work or work effectively together toward a common set of goals without other supporting technology or infrastructure.

So what might be more useful instead? If the goal is to determine what, ultimately, AI agents are capable of in an enterprise context, we should experiment with them using conditions consistent with an enterprise context. And we should ensure there’s adequate structure in place behind the scenes — such as end-to-end orchestration infrastructure — enabling AI agents to deliver genuine enterprise value.

Structure and strategy matter

People who believe AI agents are exciting because they’ll replace humans have it all wrong. AI agents are exciting not because they’ll replace humans, but because they’ll replace traditional enterprise software.

It’s in this way that AI agents could transform the enterprise — by improving not just the capacity of human-led organizations, but the experiences provided human workers inside them.

But only if we will it. For organizations of every sort — from TheAgentCompany to Alphabet to those surveyed by Gartner — getting transformational value out of agents will come down to one thing: how strategically we integrate them into the infrastructure of our day-to-day operations, and what sort of structures we put in place to govern them.

This is as true of AI agents as it is of any other sort of intelligent entity we leverage inside the enterprise, including humans. Intelligent entities need structure to work effectively. You want intelligent entities to be able to work autonomously and creatively in pursuit of the goals you set for them. But to effectively pursue those goals, you also need direction and hierarchy, governance and org charts, processes and rules.

It’s on what sort of structure we put around AI agents, in order to maximize their impact for humans, that we should be iterating and experimenting.

This is a matter, in the end, not only of strategy and performance, but of security; thinking carefully about how we construct and deploy AI agents in the enterprise, for example, is how we will wall off AI from things it shouldn’t be touching internally, such as login credentials, sensitive data or certain actions.

It’s also, however, the only way we’ll ever truly find out just what this technology is capable of. Anything else is a waste of time.

Sagi Eliyahu is the co-founder and CEO of Tonkean, an AI-powered intake and orchestration platform that helps enterprise-shared service teams such as procurement, legal, IT and HR create processes that people actually follow. Tonkean’s agents use AI to anticipate employees’ needs and guide them through their requests.

Related reading:


Stay up to date with recent funding rounds, acquisitions, and more with the
Crunchbase Daily.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleKlaviyo launches conversational AI agent
Next Article Urmilatai Karad Auditorium Inaugurated At MIT-ADT, Honouring Legacy Of Sacrifice
Advanced AI Editor
  • Website

Related Posts

Manufacturing, AI And Publishing Attract Investor Dollars

July 18, 2025

Startup M&A Crests Higher In First Half Of 2025

July 18, 2025

Lovable, A Swedish AI Vibe Coding Startup, Becomes Unicorn With $200M Series A

July 17, 2025

Comments are closed.

Latest Posts

Fine Arts Museums of San Francisco Lay Off 12 Staff

Sam Gilliam Foundation, David Kordansky Sued Over ‘Disavowed’ Painting

Donors Reportedly Pulling Support from Florida University Museum after its Controversial Transfer

What will come of the Guggenheim Asher legal battle?

Latest Posts

AI Testing and Evaluation: Reflections

July 21, 2025

Grok’s AI companions drove downloads, but its latest model is the one making money

July 21, 2025

Bias Crisis in Talent Acquisition

July 21, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • AI Testing and Evaluation: Reflections
  • Grok’s AI companions drove downloads, but its latest model is the one making money
  • Bias Crisis in Talent Acquisition
  • OpenAI jumps gun on International Math Olympiad gold medal announcement
  • Can Comet replace Google Chrome? An in-depth look at Perplexity’s new agentic AI browser | Technology News

Recent Comments

  1. fpmarkGoods on How Cursor and Claude Are Developing AI Coding Tools Together
  2. avenue17 on Local gov’t reps say they look forward to working with Thomas
  3. Lucky Star on Former Tesla AI czar Andrej Karpathy coins ‘vibe coding’: Here’s what it means
  4. микрокредит on Former Tesla AI czar Andrej Karpathy coins ‘vibe coding’: Here’s what it means
  5. www.binance.com注册 on MGX, Bpifrance, Nvidia, and Mistral AI plan 1.4GW Paris data center campus

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.