Paper page - Chain-of-Thought Tokens are Computer Program Variables

Chain-of-thoughts (CoT) requires large language models (LLMs) to generate
intermediate steps before reaching the final answer, and has been proven
effective to help LLMs solve complex reasoning tasks. However, the inner
mechanism of CoT still remains largely unclear. In this paper, we empirically
study the role of CoT tokens in LLMs on two compositional tasks: multi-digit
multiplication and dynamic programming. While CoT is essential for solving
these problems, we find that preserving only tokens that store intermediate
results would achieve comparable performance. Furthermore, we observe that
storing intermediate results in an alternative latent form will not affect
model performance. We also randomly intervene some values in CoT, and notice
that subsequent CoT tokens and the final answer would change correspondingly.
These findings suggest that CoT tokens may function like variables in computer
programs but with potential drawbacks like unintended shortcuts and
computational complexity limits between tokens. The code and data are available
at https://github.com/solitaryzero/CoTs_are_Variables.

Source link

What's Hot

Mira Murati’s Thinking Machines Lab is worth $12B in seed round

NVIDIA’s New AI: Insanely Good!

IBM vs. Amazon: Which Cloud Infrastructure Stock Offers More Upside? – July 15, 2025

Paper page – Chain-of-Thought Tokens are Computer Program Variables

Paper page – EmbRACE-3K: Embodied Reasoning and Action in Complex Environments

Paper page – LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers

Paper page – Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

The Artists and Art Pros Who Donated to Cuomo and Mamdani’s Campaigns

Phillips Sues Billionaire’s Son Over $14.5 M. Pollock Painting

Murujuga Rock Art in Australia Receives UNESCO World Heritage Status

‘Earth Room’ Caretaker Dies at 70

Mira Murati’s Thinking Machines Lab is worth $12B in seed round

NVIDIA’s New AI: Insanely Good!

IBM vs. Amazon: Which Cloud Infrastructure Stock Offers More Upside? – July 15, 2025

What's Hot

Paper page – Chain-of-Thought Tokens are Computer Program Variables

Related Posts

Subscribe to Updates