arXiv 2412.18547

Token-Budget-Aware LLM Reasoning

By Tingxu Han, Zhenting Wang, et al.

Published 2024-12-24

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Reasoning is critical for large language models (LLMs) to excel in a wide range of tasks. While methods like Chain-of-Thought (CoT) reasoning and enhance LLM performance by decomposing problems into intermediate steps, they also incur significant overhead in token usage, leading to increased costs. We find that the reasoning process of current LLMs is unnecessarily lengthy and it can be compressed by including a rea…

View the original paper on arXiv