Mind-Paced Speaking: A Dual-Brain Approach To Real-Time Reasoning In Spoken Language Models - Takara TLDR

Real-time Spoken Language Models (SLMs) struggle to leverage Chain-of-Thought
(CoT) reasoning due to the prohibitive latency of generating the entire thought
process sequentially. Enabling SLMs to think while speaking, similar to humans,
is attracting increasing attention. We present, for the first time, Mind-Paced
Speaking (MPS), a brain-inspired framework that enables high-fidelity,
real-time reasoning. Similar to how humans utilize distinct brain regions for
thinking and responding, we propose a novel dual-brain approach, employing a
“Formulation Brain” for high-level reasoning to pace and guide a separate
“Articulation Brain” for fluent speech generation. This division of labor
eliminates mode-switching, preserving the integrity of the reasoning process.
Experiments show that MPS significantly outperforms existing
think-while-speaking methods and achieves reasoning performance comparable to
models that pre-compute the full CoT before speaking, while drastically
reducing latency. Under a zero-latency configuration, the proposed method
achieves an accuracy of 92.8% on the mathematical reasoning task Spoken-MQA and
attains a score of 82.5 on the speech conversation task URO-Bench. Our work
effectively bridges the gap between high-quality reasoning and real-time
interaction.

Source link

What's Hot

TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control – Takara TLDR

US, China leaders will avoid ‘race to the bottom’ on trade, Alibaba’s Joe Tsai says

MIT rejects Trump funding compact, ignites academic freedom showdown

Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models – Takara TLDR

TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control – Takara TLDR

Dyna-Mind: Learning to Simulate from Experience for Better AI Agents – Takara TLDR

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km – Takara TLDR

Artist Behind Canterbury Cathedral Art Responds to JD Vance, Elon Musk

Jenkins Johnson Gallery to Open Tribeca Outpost on Marian Goodman Gallery’s Third Floor

Toledo Museum of Art Director on Digital Art, AI, and Future-Proofing

Smithsonian Closes Museums Amid Government Shutdown

TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control – Takara TLDR

US, China leaders will avoid ‘race to the bottom’ on trade, Alibaba’s Joe Tsai says

MIT rejects Trump funding compact, ignites academic freedom showdown

What's Hot

Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models – Takara TLDR

Related Posts

Subscribe to Updates