Paper page - ECoRAG: Evidentiality-guided Compression for Long Context RAG

ECoRAG framework enhances LLM performance in ODQA by compressing retrieved documents based on evidentiality, reducing latency and token usage.

Large Language Models (LLMs) have shown remarkable performance in Open-Domain
Question Answering (ODQA) by leveraging external documents through
Retrieval-Augmented Generation (RAG). To reduce RAG overhead, from longer
context, context compression is necessary. However, prior compression methods
do not focus on filtering out non-evidential information, which limit the
performance in LLM-based RAG. We thus propose Evidentiality-guided RAG, or
ECoRAG framework. ECoRAG improves LLM performance by compressing retrieved
documents based on evidentiality, ensuring whether answer generation is
supported by the correct evidence. As an additional step, ECoRAG reflects
whether the compressed content provides sufficient evidence, and if not,
retrieves more until sufficient. Experiments show that ECoRAG improves LLM
performance on ODQA tasks, outperforming existing compression methods.
Furthermore, ECoRAG is highly cost-efficient, as it not only reduces latency
but also minimizes token usage by retaining only the necessary information to
generate the correct answer. Code is available at
https://github.com/ldilab/ECoRAG.

Source link

What's Hot

CPPIB loans $225-million for expansion of Ontario AI computing data centre

AI agents unifying structured and unstructured data: Transforming support analytics and beyond with Amazon Q Plugins

NTT DATA, Mistral AI to Jointly Deploy Safe and Private Enterprise-grade AI Solutions

Paper page – ECoRAG: Evidentiality-guided Compression for Long Context RAG

Paper page – ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper page – Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper page – Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

Theatre Director and Artist Dies at 83

France to Accelerate Return of Looted Artworks—and More Art News

Person Dies After Jumping from Whitney Museum

At Aspen Art Week, Bigger Fairs Make for a High-Altitude Market Bet

CPPIB loans $225-million for expansion of Ontario AI computing data centre

AI agents unifying structured and unstructured data: Transforming support analytics and beyond with Amazon Q Plugins

NTT DATA, Mistral AI to Jointly Deploy Safe and Private Enterprise-grade AI Solutions

What's Hot

Paper page – ECoRAG: Evidentiality-guided Compression for Long Context RAG

Related Posts

Subscribe to Updates