arXiv 2503.08679
Chain-of-Thought Reasoning In The Wild Is Not Always Faithful
By Iván Arcuschin, Jett Janiak, et al.
Published 2025-03-11
Discussion
Read the public discussion and references gathered around this paper.
Chain-of-Thought (CoT) reasoning has significantly advanced state-of-the-art AI capabilities. However, recent studies have shown that CoT reasoning is not always faithful when models face an explicit bias in their prompts, i.e., the CoT can give an incorrect picture of how models arrive at conclusions. We go further and show that unfaithful CoT can also occur on realistic prompts with no artificial bias. We find tha…