Generalized Parallel Scaling With Interdependent Generations - Takara TLDR

Parallel LLM inference scaling involves sampling a set of $N>1$ responses for
a single input prompt. However, these $N$ parallel responses tend to be
generated independently from each other, partitioning compute resources and
leaving potentially useful information in one generation untapped by others.
This is in contrast to response length scaling where past computation is used
in all future steps. For higher quality responses and response sets, we propose
Bridge to generate interdependent responses in parallel by rethinking batched
LLM hidden states as holistic tensors rather than independent slices. With only
a small amount (2.8%-5.1%) of new parameters, Bridge improves the relative mean
accuracy gains from reinforcement learning with verifiable rewards by up to 50%
and boosts consistency of correct responses. Trained once, Bridge scales to any
generation width, all with greater performance than independent generations,
unlocking a more general mode of parallel scaling that effectively leverages
information between sequences, compatible with any post-generation aggregation
technique.

Source link

What's Hot

Rethinking Thinking Tokens: LLMs as Improvement Operators – Takara TLDR

OpenAI and Jony Ive may be struggling to figure out their AI device

Generalized Parallel Scaling with Interdependent Generations – Takara TLDR

Generalized Parallel Scaling with Interdependent Generations – Takara TLDR

Rethinking Thinking Tokens: LLMs as Improvement Operators – Takara TLDR

TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments – Takara TLDR

Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models – Takara TLDR

Former ARTnews Publisher Dies at 97

National Gallery of Art Closes as a Result of Government Shutdown

Almine Rech Closes London Gallery After More Than a Decade

Record Exec and Art Collector Gets Over 4 Years

Rethinking Thinking Tokens: LLMs as Improvement Operators – Takara TLDR

OpenAI and Jony Ive may be struggling to figure out their AI device