Learning To Reason For Hallucination Span Detection - Takara TLDR

Large language models (LLMs) often generate hallucinations — unsupported
content that undermines reliability. While most prior works frame hallucination
detection as a binary task, many real-world applications require identifying
hallucinated spans, which is a multi-step decision making process. This
naturally raises the question of whether explicit reasoning can help the
complex task of detecting hallucination spans. To answer this question, we
first evaluate pretrained models with and without Chain-of-Thought (CoT)
reasoning, and show that CoT reasoning has the potential to generate at least
one correct answer when sampled multiple times. Motivated by this, we propose
RL4HS, a reinforcement learning framework that incentivizes reasoning with a
span-level reward function. RL4HS builds on Group Relative Policy Optimization
and introduces Class-Aware Policy Optimization to mitigate reward imbalance
issue. Experiments on the RAGTruth benchmark (summarization, question
answering, data-to-text) show that RL4HS surpasses pretrained reasoning models
and supervised fine-tuning, demonstrating the necessity of reinforcement
learning with span-level rewards for detecting hallucination spans.

Source link

What's Hot

Rethinking the shape convention of an MLP – Takara TLDR

We Tested the Best Free AI Image Editors—Here’s What You’ll Love and Hate

OpenAI appears to be walking back its Sora copyright policy

Learning to Reason for Hallucination Span Detection – Takara TLDR

Rethinking the shape convention of an MLP – Takara TLDR

Sparse Query Attention (SQA): A Computationally Efficient Attention Mechanism with Query Heads Reduction – Takara TLDR

A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports – Takara TLDR

Record Exec and Art Collector Gets Over 4 Years

Chicago’s Art Scene Offers a Beacon of Hope for Artists and Dealers

Pace to Close Hong Kong Gallery at H Queen’s This Month

Taylor Swift’s ‘Fate of Ophelia’ Has a Lot in Common with This Artwork

Rethinking the shape convention of an MLP – Takara TLDR

We Tested the Best Free AI Image Editors—Here’s What You’ll Love and Hate

OpenAI appears to be walking back its Sora copyright policy

What's Hot

Learning to Reason for Hallucination Span Detection – Takara TLDR

Related Posts

Subscribe to Updates