Learning To Reason For Hallucination Span Detection - Takara TLDR

Large language models (LLMs) often generate hallucinations — unsupported
content that undermines reliability. While most prior works frame hallucination
detection as a binary task, many real-world applications require identifying
hallucinated spans, which is a multi-step decision making process. This
naturally raises the question of whether explicit reasoning can help the
complex task of detecting hallucination spans. To answer this question, we
first evaluate pretrained models with and without Chain-of-Thought (CoT)
reasoning, and show that CoT reasoning has the potential to generate at least
one correct answer when sampled multiple times. Motivated by this, we propose
RL4HS, a reinforcement learning framework that incentivizes reasoning with a
span-level reward function. RL4HS builds on Group Relative Policy Optimization
and introduces Class-Aware Policy Optimization to mitigate reward imbalance
issue. Experiments on the RAGTruth benchmark (summarization, question
answering, data-to-text) show that RL4HS surpasses pretrained reasoning models
and supervised fine-tuning, demonstrating the necessity of reinforcement
learning with span-level rewards for detecting hallucination spans.

Source link

What's Hot

Sparse Query Attention (SQA): A Computationally Efficient Attention Mechanism with Query Heads Reduction – Takara TLDR

Learning to Reason for Hallucination Span Detection – Takara TLDR

A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports – Takara TLDR

Learning to Reason for Hallucination Span Detection – Takara TLDR

Sparse Query Attention (SQA): A Computationally Efficient Attention Mechanism with Query Heads Reduction – Takara TLDR

A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports – Takara TLDR

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets? – Takara TLDR

Record Exec and Art Collector Gets Over 4 Years

Chicago’s Art Scene Offers a Beacon of Hope for Artists and Dealers

Pace to Close Hong Kong Gallery at H Queen’s This Month

Taylor Swift’s ‘Fate of Ophelia’ Has a Lot in Common with This Artwork

Sparse Query Attention (SQA): A Computationally Efficient Attention Mechanism with Query Heads Reduction – Takara TLDR