Paper Page - ReasonIR: Training Retrievers For Reasoning Tasks

We present ReasonIR-8B, the first retriever specifically trained for general
reasoning tasks. Existing retrievers have shown limited gains on reasoning
tasks, in part because existing training datasets focus on short factual
queries tied to documents that straightforwardly answer them. We develop a
synthetic data generation pipeline that, for each document, our pipeline
creates a challenging and relevant query, along with a plausibly related but
ultimately unhelpful hard negative. By training on a mixture of our synthetic
data and existing public data, ReasonIR-8B achieves a new state-of-the-art of
29.9 nDCG@10 without reranker and 36.9 nDCG@10 with reranker on BRIGHT, a
widely-used reasoning-intensive information retrieval (IR) benchmark. When
applied to RAG tasks, ReasonIR-8B improves MMLU and GPQA performance by 6.4%
and 22.6% respectively, relative to the closed-book baseline, outperforming
other retrievers and search engines. In addition, ReasonIR-8B uses test-time
compute more effectively: on BRIGHT, its performance consistently increases
with longer and more information-rich rewritten queries; it continues to
outperform other retrievers when combined with an LLM reranker. Our training
recipe is general and can be easily extended to future LLMs; to this end, we
open-source our code, data, and model.

Source link

What's Hot

IBM bags Vi project to launch AI Innovation Hub, modernise ops

DeepSeek-R1: Hype cools as India seeks practical GenAI solutions

Google Docs gets AI voice reader, lets you turn your documents into audio with a click

Paper page – ReasonIR: Training Retrievers for Reasoning Tasks

Next Visual Granularity Generation – Takara TLDR

ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning – Takara TLDR

4DNeX: Feed-Forward 4D Generative Modeling Made Easy – Takara TLDR

Barbara Hepworth Sculpture Will Remain in UK After £3.8 M. Raised

After 12-Year Hiatus, Egypt’s Alexandria Biennale Will Return

Ai Weiwei Visits Ukraine’s Front Line Ahead of Kyiv Installation

Maren Hassinger to Receive Her Largest Retrospective to Date Next Year

IBM bags Vi project to launch AI Innovation Hub, modernise ops

DeepSeek-R1: Hype cools as India seeks practical GenAI solutions

Google Docs gets AI voice reader, lets you turn your documents into audio with a click

What's Hot

Paper page – ReasonIR: Training Retrievers for Reasoning Tasks

Related Posts

Subscribe to Updates