Paper Page - WebThinker: Empowering Large Reasoning Models With Deep Research Capability

Introduction

We propose WebThinker, a deep research agent that empowers LRMs to autonomously search the web, navigate web pages, and draft research reports during the reasoning process. WebThinker integrates a Deep Web Explorer module, enabling LRMs to dynamically search, navigate, and extract information from the web when encountering knowledge gaps. It also employs an Autonomous Think-Search-and-Draft strategy, allowing the model to seamlessly interleave reasoning, information gathering, and report writing in real time. To further enhance research tool utilization, we introduce an RL-based training strategy via iterative online Direct Preference Optimization (DPO). Extensive experiments on complex reasoning benchmarks (GPQA, GAIA, WebWalkerQA, HLE) and scientific report generation tasks (Glaive) demonstrate that WebThinker significantly outperforms existing methods and strong proprietary systems. Our approach enhances LRM reliability and applicability in complex scenarios, paving the way for more capable and versatile deep research systems.

What's Hot

Stability AI launches its ‘most sophisticated’ image generator yet

Artificial intelligence could end disease, lead to “radical abundance,” Google DeepMind CEO Demis Hassabis says

Elon Musk’s xAI sues Apple and OpenAI over App Store drama

Paper page – WebThinker: Empowering Large Reasoning Models with Deep Research Capability

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs – Takara TLDR

CRISP: Persistent Concept Unlearning via Sparse Autoencoders – Takara TLDR

End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning – Takara TLDR

People Inc. Sells Oldenburg and Van Bruggen ‘Plantoir’ Sculpture

Amy Sherald Speaks Out About Government Censorship at the Smithsonian

Dealers Living Like Collectors, Egypt’s Tourism and More: Morning Links

Mütter Museum in Philadelphia Announces New Policy for Human Remains

Stability AI launches its ‘most sophisticated’ image generator yet

Artificial intelligence could end disease, lead to “radical abundance,” Google DeepMind CEO Demis Hassabis says

Elon Musk’s xAI sues Apple and OpenAI over App Store drama

What's Hot

Paper page – WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Introduction

Our Github Repo：https://github.com/RUC-NLPIR/WebThinker?tab=readme-ov-file

Demo:

Main Result Overview:

Our WebThinker Framework:

Related Posts

Subscribe to Updates