In-Place Feedback: A New Paradigm For Guiding LLMs In Multi-Turn Reasoning - Takara TLDR

Large language models (LLMs) are increasingly studied in the context of
multi-turn reasoning, where models iteratively refine their outputs based on
user-provided feedback. Such settings are crucial for tasks that require
complex reasoning, yet existing feedback paradigms often rely on issuing new
messages. LLMs struggle to integrate these reliably, leading to inconsistent
improvements. In this work, we introduce in-place feedback, a novel interaction
paradigm in which users directly edit an LLM’s previous response, and the model
conditions on this modified response to generate its revision. Empirical
evaluations on diverse reasoning-intensive benchmarks reveal that in-place
feedback achieves better performance than conventional multi-turn feedback
while using $79.1\%$ fewer tokens. Complementary analyses on controlled
environments further demonstrate that in-place feedback resolves a core
limitation of multi-turn feedback: models often fail to apply feedback
precisely to erroneous parts of the response, leaving errors uncorrected and
sometimes introducing new mistakes into previously correct content. These
findings suggest that in-place feedback offers a more natural and effective
mechanism for guiding LLMs in reasoning-intensive tasks.

Source link

What's Hot

Does CEO Transition and Legal Turmoil Change the Bull Case for C3.ai (AI)?

On Predictability of Reinforcement Learning Dynamics for Large Language Models – Takara TLDR

How Safe Is Your Facial Data With OpenAI’s Sora App?

In-Place Feedback: A New Paradigm for Guiding LLMs in Multi-Turn Reasoning – Takara TLDR

On Predictability of Reinforcement Learning Dynamics for Large Language Models – Takara TLDR

Making, not Taking, the Best of N – Takara TLDR

GEM: A Gym for Agentic LLMs – Takara TLDR

Sotheby’s Sells York Avenue HQ to Weill Cornell, Prepares Breuer Move

Outsider Art Fair’s New Director Elizabeth Denny Discusses Her Role

50 Pianos Sound Off in ’11,000 Strings’ at the Park Avenue Armory

Five Arts and Culture Nonprofits Join NYC’s Cultural Institutions Group

Does CEO Transition and Legal Turmoil Change the Bull Case for C3.ai (AI)?

On Predictability of Reinforcement Learning Dynamics for Large Language Models – Takara TLDR

How Safe Is Your Facial Data With OpenAI’s Sora App?

What's Hot

In-Place Feedback: A New Paradigm for Guiding LLMs in Multi-Turn Reasoning – Takara TLDR

Related Posts

Subscribe to Updates