ReasonBridge: Efficient Reasoning Transfer from Closed to Open-Source Language Models

arXiv:2506.22865v1 Announce Type: new
Abstract: Recent advancements in Large Language Models (LLMs) have revealed a significant performance gap between closed-source and open-source models, particularly in tasks requiring complex reasoning and precise instruction following. This paper introduces ReasonBridge, a methodology that efficiently transfers reasoning capabilities from powerful closed-source to open-source models through a novel hierarchical knowledge distillation framework. We develop a tailored dataset Reason1K with only 1,000 carefully curated reasoning traces emphasizing difficulty, diversity, and quality. These traces are filtered from across multiple domains using a structured multi-criteria selection algorithm. Our transfer learning approach incorporates: (1) a hierarchical distillation process capturing both strategic abstraction and tactical implementation patterns, (2) a sparse reasoning-focused adapter architecture requiring only 0.3% additional trainable parameters, and (3) a test-time compute scaling mechanism using guided inference interventions. Comprehensive evaluations demonstrate that ReasonBridge improves reasoning capabilities in open-source models by up to 23% on benchmark tasks, significantly narrowing the gap with closed-source models. Notably, the enhanced Qwen2.5-14B outperforms Claude-Sonnet3.5 on MATH500 and matches its performance on competition-level AIME problems. Our methodology generalizes effectively across diverse reasoning domains and model architectures, establishing a sample-efficient approach to reasoning enhancement for instruction following.

Source link

What's Hot

Blacklisted by the U.S. and backed by Beijing, this Chinese AI startup has caught OpenAI’s attention – NBC 6 South Florida

Shutterstock Expands AI Horizons: New Partnership with Reka AI to Enhance Digital Asset Metadata

Paper page – Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models

ReasonBridge: Efficient Reasoning Transfer from Closed to Open-Source Language Models

Scaling LLM Planning: NL2FLOW for Parametric Problem Generation and Rigorous Evaluation

[2506.22355] Embodied AI Agents: Modeling the World

Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs

Albright College is Selling Its Art Collection to Balance Its Books

Big Three Auction Houses Hold Old Masters Sales in London This Week

MFA Boston Returns Two Works to Kingdom of Benin

Tate’s £150M Endowment Campaign May Include Turbine Hall Naming Rights

Blacklisted by the U.S. and backed by Beijing, this Chinese AI startup has caught OpenAI’s attention – NBC 6 South Florida

Shutterstock Expands AI Horizons: New Partnership with Reka AI to Enhance Digital Asset Metadata

Paper page – Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models

What's Hot

ReasonBridge: Efficient Reasoning Transfer from Closed to Open-Source Language Models

Related Posts

Subscribe to Updates