Paper Page - Reranking-based Generation For Unbiased Perspective Summarization

Reranking and preference tuning improve the quality of perspective summaries generated by LLMs, as measured by language model-based metrics that outperform traditional ones.

Generating unbiased summaries in real-world settings such as political
perspective summarization remains a crucial application of Large Language
Models (LLMs). Yet, existing evaluation frameworks rely on traditional metrics
for measuring key attributes such as coverage and faithfulness without
verifying their applicability, and efforts to develop improved summarizers are
still nascent. We address these gaps by (1) identifying reliable metrics for
measuring perspective summary quality, and (2) investigating the efficacy of
LLM-based methods beyond zero-shot inference. Namely, we build a test set for
benchmarking metric reliability using human annotations and show that
traditional metrics underperform compared to language model-based metrics,
which prove to be strong evaluators. Using these metrics, we show that
reranking-based methods yield strong results, and preference tuning with
synthetically generated and reranking-labeled data further boosts performance.
Our findings aim to contribute to the reliable evaluation and development of
perspective summarization methods.

Source link

What's Hot

Work Being Rebuilt By Remote Talent

Nexl Bags $23m, Will Invest In Hires + Acquisitions – Artificial Lawyer

ASPO: Asymmetric Importance Sampling Policy Optimization – Takara TLDR

Paper page – Reranking-based Generation for Unbiased Perspective Summarization

ASPO: Asymmetric Importance Sampling Policy Optimization – Takara TLDR

Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation – Takara TLDR

Deforming Videos to Masks: Flow Matching for Referring Video Segmentation – Takara TLDR

Matthiesen Gallery Files Lawsuit Over Gustave Courbet Painting

MoMA Partners with Mattel for Van Gogh Barbie, Monet and Dalí Figures

Underground Film Legend and Artist Dies at 92

Artwork Forfeited by Inigo Philbrick’s Partner Flops at Sotheby’s

Work Being Rebuilt By Remote Talent

Nexl Bags $23m, Will Invest In Hires + Acquisitions – Artificial Lawyer

ASPO: Asymmetric Importance Sampling Policy Optimization – Takara TLDR

What's Hot

Paper page – Reranking-based Generation for Unbiased Perspective Summarization

Related Posts

Subscribe to Updates