Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Australia’s biggest bank cut staff for AI, then it backtracked – and it’s one of many scrapping plans for automated customer support teams

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward – Takara TLDR

Bulgarian Doctoral Student Anna-Maria Halacheva Recognized by European Commission – Novinite.com

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Business AI
    • Advanced AI News Features
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
AI Search

Azure AI Search Unveils Agentic Retrieval for Smarter Conversational AI

By Advanced AI EditorMay 31, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Microsoft has launched the public preview of agentic retrieval in Azure AI Search, a query engine that autonomously plans and executes retrieval strategies for complex questions. According to the company, it enhances answer relevance in conversational AI by up to 40% compared to traditional RAG. This multi-turn system leverages conversation history and Azure OpenAI to break down queries into focused subqueries, executed in parallel across text and vector embeddings.

This new capability is supported programmatically through a new Knowledge Agents object in the 2025-05-01-preview data plane REST API and Azure SDK prerelease packages. It builds on Azure AI Search’s existing index, a dedicated “Agent” resource that links to Azure OpenAI, and the retrieval engine orchestrating the process. Microsoft positions agentic retrieval as a crucial step toward building more sophisticated knowledge retrieval systems, explicitly designed for intelligent agents, and provides high-quality grounding data for downstream consumption.

According to the documentation, the agentic retrieval process involves the following stages: First, an LLM analyzes the entire chat thread to identify the core information. Subsequently, it plans a retrieval strategy that incorporates the chat history and the original query. Next, each subquery runs simultaneously, leveraging both keyword and semantic search capabilities of Azure AI Search. In a Microsoft Build session, Matthew Gotteiner explained:

It’s important to note that the overall speed of agentic retrieval is directly related to the number of subqueries generated. While running subqueries in parallel aims to accelerate the process, a more complex query requiring numerous subqueries will naturally take longer to complete. Counterintuitively, a “mini” query planner that generates fewer, broader subqueries might return results faster than a “full-size” planner designed to create a larger number of highly focused subqueries.

The results are reranked using the platform’s semantic ranker into a unified grounding payload with top hits and structured metadata. And finally, the API also returns a detailed activity log of the retrieval process.

(Source: Microsoft Tech community blog post)

Akshay Kokane, a Software Engineer at Microsoft, concluded in a Medium blog post:

Traditional RAG systems are a great starting point for enhancing LLMs with domain-specific knowledge — especially when using tools like Semantic Kernel and Azure AI Search, which simplify embedding and retrieval. However, as enterprise use cases become more complex, the limitations of static, linear workflows become apparent.


Agentic RAG (ARAG) addresses this gap by introducing dynamic reasoning, intelligent tool selection, and iterative refinement. Agents can adapt their search strategies, evaluate results, and construct more precise, context-aware answers — making them ideal for evolving business needs, compliance workflows, or multi-source data environments.

Lastly, the public preview is currently available in select regions, and the agentic retrieval pricing includes per-token billing for Azure OpenAI’s query planning and Azure AI Search’s semantic ranking, both of which are free during the initial preview. Documentation, a cookbook, and integration guidance with Azure AI Agent Service are available for developers.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleReal TikTokers are pretending to be Veo 3 AI creations for fun, attention
Next Article Scott Aaronson: Quantum Computing | Lex Fridman Podcast #72
Advanced AI Editor
  • Website

Related Posts

AI search optimization? GEO? SEOs can’t agree on a name: Survey

September 11, 2025

Explaining Google’s AI Search Experiments To Your C-Suite

September 11, 2025

Google’s AI is the ‘worst’ for stealing content, says People CEO

September 11, 2025
Leave A Reply

Latest Posts

Long-Lost Painting By Rubens From 1613 Discovered in Paris Mansion

Ken Griffin Loves Pollock’s Blue Poles So Much He Tried to Buy it

Sally Mann Says Her Black Men Photos Are ‘Problematic’ in Hindsight

NeueHouse, a Hot Spot for Art Events, Files for Bankruptcy

Latest Posts

Australia’s biggest bank cut staff for AI, then it backtracked – and it’s one of many scrapping plans for automated customer support teams

September 12, 2025

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward – Takara TLDR

September 12, 2025

Bulgarian Doctoral Student Anna-Maria Halacheva Recognized by European Commission – Novinite.com

September 12, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Australia’s biggest bank cut staff for AI, then it backtracked – and it’s one of many scrapping plans for automated customer support teams
  • The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward – Takara TLDR
  • Bulgarian Doctoral Student Anna-Maria Halacheva Recognized by European Commission – Novinite.com
  • MIT CSAIL’s drone system embraces uncertainty
  • TikTok parent Bytedance launches new AI tool Seedream 4.0 to rival Google’s Nano Banana

Recent Comments

  1. whackysalamander8Nalay on Foundation AI: Cisco launches AI model for integration in security applications
  2. fluffyglowcrab9Nalay on Foundation AI: Cisco launches AI model for integration in security applications
  3. リアルラブドール on 24 Hour Ticket Offer – Legal Innovators California – June 11 + 12 – Artificial Lawyer
  4. zippyglowworm5Nalay on Foundation AI: Cisco launches AI model for integration in security applications
  5. WalterPew on Trump’s Tech Sanctions To Empower China, Betray America

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.