Close Menu
  • Home
  • AI Models
    • DeepSeek
    • xAI
    • OpenAI
    • Meta AI Llama
    • Google DeepMind
    • Amazon AWS AI
    • Microsoft AI
    • Anthropic (Claude)
    • NVIDIA AI
    • IBM WatsonX Granite 3.1
    • Adobe Sensi
    • Hugging Face
    • Alibaba Cloud (Qwen)
    • Baidu (ERNIE)
    • C3 AI
    • DataRobot
    • Mistral AI
    • Moonshot AI (Kimi)
    • Google Gemma
    • xAI
    • Stability AI
    • H20.ai
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Microsoft Research
    • Meta AI Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding & Startups
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • Expert Insights & Videos
    • Google DeepMind
    • Lex Fridman
    • Matt Wolfe AI
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • Matt Wolfe AI
    • The TechLead
    • Andrew Ng
    • OpenAI
  • Expert Blogs
    • François Chollet
    • Gary Marcus
    • IBM
    • Jack Clark
    • Jeremy Howard
    • Melanie Mitchell
    • Andrew Ng
    • Andrej Karpathy
    • Sebastian Ruder
    • Rachel Thomas
    • IBM
  • AI Policy & Ethics
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
    • EFF AI
    • European Commission AI
    • Partnership on AI
    • Stanford HAI Policy
    • Mozilla Foundation AI
    • Future of Life Institute
    • Center for AI Safety
    • World Economic Forum AI
  • AI Tools & Product Releases
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
    • Image Generation
    • Video Generation
    • Writing Tools
    • AI for Recruitment
    • Voice/Audio Generation
  • Industry Applications
    • Finance AI
    • Healthcare AI
    • Legal AI
    • Manufacturing AI
    • Media & Entertainment
    • Transportation AI
    • Education AI
    • Retail AI
    • Agriculture AI
    • Energy AI
  • AI Art & Entertainment
    • AI Art News Blog
    • Artvy Blog » AI Art Blog
    • Weird Wonderful AI Art Blog
    • The Chainsaw » AI Art
    • Artvy Blog » AI Art Blog
What's Hot

Perplexity Plans to Bring Comet AI Browser to Smartphones

Hey you, AI algorithm! Explain yourself!

IBM launches global entrance test for MBA, MCA, MSc admissions | Bengaluru News

Facebook X (Twitter) Instagram
Advanced AI News
  • Home
  • AI Models
    • OpenAI (GPT-4 / GPT-4o)
    • Anthropic (Claude 3)
    • Google DeepMind (Gemini)
    • Meta (LLaMA)
    • Cohere (Command R)
    • Amazon (Titan)
    • IBM (Watsonx)
    • Inflection AI (Pi)
  • AI Research
    • Allen Institue for AI
    • arXiv AI
    • Berkeley AI Research
    • CMU AI
    • Google Research
    • Meta AI Research
    • Microsoft Research
    • OpenAI Research
    • Stanford HAI
    • MIT CSAIL
    • Harvard AI
  • AI Funding
    • AI Funding Database
    • CBInsights AI
    • Crunchbase AI
    • Data Robot Blog
    • TechCrunch AI
    • VentureBeat AI
    • The Information AI
    • Sifted AI
    • WIRED AI
    • Fortune AI
    • PitchBook
    • TechRepublic
    • SiliconANGLE – Big Data
    • MIT News
    • Data Robot Blog
  • AI Experts
    • Google DeepMind
    • Lex Fridman
    • Meta AI Llama
    • Yannic Kilcher
    • Two Minute Papers
    • AI Explained
    • TheAIEdge
    • The TechLead
    • Matt Wolfe AI
    • Andrew Ng
    • OpenAI
    • Expert Blogs
      • François Chollet
      • Gary Marcus
      • IBM
      • Jack Clark
      • Jeremy Howard
      • Melanie Mitchell
      • Andrew Ng
      • Andrej Karpathy
      • Sebastian Ruder
      • Rachel Thomas
      • IBM
  • AI Tools
    • AI Assistants
    • AI for Recruitment
    • AI Search
    • Coding Assistants
    • Customer Service AI
  • AI Policy
    • ACLU AI
    • AI Now Institute
    • Center for AI Safety
  • Industry AI
    • Finance AI
    • Healthcare AI
    • Education AI
    • Energy AI
    • Legal AI
LinkedIn Instagram YouTube Threads X (Twitter)
Advanced AI News
AI Search

Azure AI Search Unveils Agentic Retrieval for Smarter Conversational AI

By Advanced AI EditorMay 31, 2025No Comments3 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Microsoft has launched the public preview of agentic retrieval in Azure AI Search, a query engine that autonomously plans and executes retrieval strategies for complex questions. According to the company, it enhances answer relevance in conversational AI by up to 40% compared to traditional RAG. This multi-turn system leverages conversation history and Azure OpenAI to break down queries into focused subqueries, executed in parallel across text and vector embeddings.

This new capability is supported programmatically through a new Knowledge Agents object in the 2025-05-01-preview data plane REST API and Azure SDK prerelease packages. It builds on Azure AI Search’s existing index, a dedicated “Agent” resource that links to Azure OpenAI, and the retrieval engine orchestrating the process. Microsoft positions agentic retrieval as a crucial step toward building more sophisticated knowledge retrieval systems, explicitly designed for intelligent agents, and provides high-quality grounding data for downstream consumption.

According to the documentation, the agentic retrieval process involves the following stages: First, an LLM analyzes the entire chat thread to identify the core information. Subsequently, it plans a retrieval strategy that incorporates the chat history and the original query. Next, each subquery runs simultaneously, leveraging both keyword and semantic search capabilities of Azure AI Search. In a Microsoft Build session, Matthew Gotteiner explained:

It’s important to note that the overall speed of agentic retrieval is directly related to the number of subqueries generated. While running subqueries in parallel aims to accelerate the process, a more complex query requiring numerous subqueries will naturally take longer to complete. Counterintuitively, a “mini” query planner that generates fewer, broader subqueries might return results faster than a “full-size” planner designed to create a larger number of highly focused subqueries.

The results are reranked using the platform’s semantic ranker into a unified grounding payload with top hits and structured metadata. And finally, the API also returns a detailed activity log of the retrieval process.

(Source: Microsoft Tech community blog post)

Akshay Kokane, a Software Engineer at Microsoft, concluded in a Medium blog post:

Traditional RAG systems are a great starting point for enhancing LLMs with domain-specific knowledge — especially when using tools like Semantic Kernel and Azure AI Search, which simplify embedding and retrieval. However, as enterprise use cases become more complex, the limitations of static, linear workflows become apparent.


Agentic RAG (ARAG) addresses this gap by introducing dynamic reasoning, intelligent tool selection, and iterative refinement. Agents can adapt their search strategies, evaluate results, and construct more precise, context-aware answers — making them ideal for evolving business needs, compliance workflows, or multi-source data environments.

Lastly, the public preview is currently available in select regions, and the agentic retrieval pricing includes per-token billing for Azure OpenAI’s query planning and Azure AI Search’s semantic ranking, both of which are free during the initial preview. Documentation, a cookbook, and integration guidance with Azure AI Agent Service are available for developers.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleReal TikTokers are pretending to be Veo 3 AI creations for fun, attention
Next Article Scott Aaronson: Quantum Computing | Lex Fridman Podcast #72
Advanced AI Editor
  • Website

Related Posts

Dig deeper in Google Search with AI Overview and three buttons

July 20, 2025

Not Google or Bing! This search engine lets you block AI images in search results

July 19, 2025

ChatGPT CEO reveals shocking cost of every AI search you make

July 18, 2025
Leave A Reply

Latest Posts

Sam Gilliam Foundation, David Kordansky Sued Over ‘Disavowed’ Painting

Donors Reportedly Pulling Support from Florida University Museum after its Controversial Transfer

What will come of the Guggenheim Asher legal battle?

Painter Says DHS Stole His Work for Post About ‘Homeland’s Heritage’

Latest Posts

Perplexity Plans to Bring Comet AI Browser to Smartphones

July 21, 2025

Hey you, AI algorithm! Explain yourself!

July 21, 2025

IBM launches global entrance test for MBA, MCA, MSc admissions | Bengaluru News

July 21, 2025

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Perplexity Plans to Bring Comet AI Browser to Smartphones
  • Hey you, AI algorithm! Explain yourself!
  • IBM launches global entrance test for MBA, MCA, MSc admissions | Bengaluru News
  • Nvidia Warns of Limited H20 AI Chip Supply Amid China Trade Uncertainty
  • Paper page – The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

Recent Comments

  1. avenue17 on Local gov’t reps say they look forward to working with Thomas
  2. Lucky Star on Former Tesla AI czar Andrej Karpathy coins ‘vibe coding’: Here’s what it means
  3. микрокредит on Former Tesla AI czar Andrej Karpathy coins ‘vibe coding’: Here’s what it means
  4. www.binance.com注册 on MGX, Bpifrance, Nvidia, and Mistral AI plan 1.4GW Paris data center campus
  5. creación de cuenta en Binance on University of Tokyo to upgrade its IBM quantum computer with 156-qubit Heron QPU

Welcome to Advanced AI News—your ultimate destination for the latest advancements, insights, and breakthroughs in artificial intelligence.

At Advanced AI News, we are passionate about keeping you informed on the cutting edge of AI technology, from groundbreaking research to emerging startups, expert insights, and real-world applications. Our mission is to deliver high-quality, up-to-date, and insightful content that empowers AI enthusiasts, professionals, and businesses to stay ahead in this fast-evolving field.

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

LinkedIn Instagram YouTube Threads X (Twitter)
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 advancedainews. Designed by advancedainews.

Type above and press Enter to search. Press Esc to cancel.