Paper page - Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent

Retrieval-augmented generation (RAG) is a common strategy to reduce hallucinations in Large Language Models (LLMs). While reinforcement learning (RL) can enable LLMs to act as search agents by activating retrieval capabilities, existing ones often underutilize their internal knowledge. This can lead to redundant retrievals, potential harmful knowledge conflicts, and increased inference latency. To address these limitations, an efficient and adaptive search agent capable of discerning optimal retrieval timing and synergistically integrating parametric (internal) and retrieved (external) knowledge is in urgent need. This paper introduces the Reinforced Internal-External Knowledge Synergistic Reasoning Agent (IKEA), which could indentify its own knowledge boundary and prioritize the utilization of internal knowledge, resorting to external search only when internal knowledge is deemed insufficient. This is achieved using a novel knowledge-boundary aware reward function and a knowledge-boundary aware training dataset. These are designed for internal-external knowledge synergy oriented RL, incentivizing the model to deliver accurate answers, minimize unnecessary retrievals, and encourage appropriate external searches when its own knowledge is lacking. Evaluations across multiple knowledge reasoning tasks demonstrate that IKEA significantly outperforms baseline methods, reduces retrieval frequency significantly, and exhibits robust generalization capabilities.

Source link

What's Hot

A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models

OpenAI delays the release of its open model, again

TU Wien Rendering #10 – Camera models

Paper page – Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent

Paper page – PyVision: Agentic Vision with Dynamic Tooling

Paper page – Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Paper page – OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Homeland Security Targets Chicago’s National Museum of Puerto Rican Arts & Culture

1,600-Year-Old Tomb of Mayan City’s Founding King Discovered in Belize

Centre Pompidou Cancels Caribbean Art Show, Raising Controversy

‘Night at the Museum’ Reboot in the Works

A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models

OpenAI delays the release of its open model, again

TU Wien Rendering #10 – Camera models

What's Hot

Paper page – Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent

Related Posts

Subscribe to Updates