Paper page - Does More Inference-Time Compute Really Help Robustness?

Recently, Zaremba et al. demonstrated that increasing inference-time
computation improves robustness in large proprietary reasoning LLMs. In this
paper, we first show that smaller-scale, open-source models (e.g., DeepSeek R1,
Qwen3, Phi-reasoning) can also benefit from inference-time scaling using a
simple budget forcing strategy. More importantly, we reveal and critically
examine an implicit assumption in prior work: intermediate reasoning steps are
hidden from adversaries. By relaxing this assumption, we identify an important
security risk, intuitively motivated and empirically verified as an inverse
scaling law: if intermediate reasoning steps become explicitly accessible,
increased inference-time computation consistently reduces model robustness.
Finally, we discuss practical scenarios where models with hidden reasoning
chains are still vulnerable to attacks, such as models with tool-integrated
reasoning and advanced reasoning extraction attacks. Our findings collectively
demonstrate that the robustness benefits of inference-time scaling depend
heavily on the adversarial setting and deployment context. We urge
practitioners to carefully weigh these subtle trade-offs before applying
inference-time scaling in security-sensitive, real-world applications.

Source link

What's Hot

Alibaba unleashes Qwen3 coding model for developers to push AI agent adoption

Google DeepMind’s new AI model helps researchers understand ancient text.

OpenAI’s ChatGPT Agent Is Haunting My Browser

Paper page – Does More Inference-Time Compute Really Help Robustness?

Paper page – HOComp: Interaction-Aware Human-Object Composition

Paper page – RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback

Paper page – Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning

Winston Artory Merger Targets $15B Art Valuation Market

Barnes Foundation Online Learning Platform Expands to Penn Museum

Archaeologists Identify 5,500-Year-Old Megalithic Tombs in Poland

Phillips to Debut ‘First-of-its Kind’ Priority Bidding Structure

Alibaba unleashes Qwen3 coding model for developers to push AI agent adoption

Google DeepMind’s new AI model helps researchers understand ancient text.

OpenAI’s ChatGPT Agent Is Haunting My Browser

What's Hot

Paper page – Does More Inference-Time Compute Really Help Robustness?

Related Posts

Subscribe to Updates