arXiv 2506.23840

Do Thinking Tokens Help or Trap? Towards More Efficient Large Reasoning Model

By Bowen Ding, Yuhan Chen, et al.

Published 2025-06-30

Citation lineage

Review the prior work and downstream research connected to this paper.

Large Reasoning Models (LRMs) excel at solving complex problems but face an overthinking dilemma. When handling simple tasks, they often produce verbose responses overloaded with thinking tokens (e.g., wait, however). These tokens trigger unnecessary high-level reasoning behaviors like reflection and backtracking, reducing efficiency. In this work, our pilot study reveals that these thinking-token-induced behaviors…

View the original paper on arXiv