arXiv 2503.08679

Chain-of-Thought Reasoning In The Wild Is Not Always Faithful

By Iván Arcuschin, Jett Janiak, et al.

Published 2025-03-11

Discussion

Read the public discussion and references gathered around this paper.

Chain-of-Thought (CoT) reasoning has significantly advanced state-of-the-art AI capabilities. However, recent studies have shown that CoT reasoning is not always faithful when models face an explicit bias in their prompts, i.e., the CoT can give an incorrect picture of how models arrive at conclusions. We go further and show that unfaithful CoT can also occur on realistic prompts with no artificial bias. We find tha…

View the original paper on arXiv