Why Models Know But Don’t Say: Chain-of-Thought Faithfulness Divergence Between Thinking Tokens and Answers in Open-Weight Reasoning Models - 2603.26410v1.pdf arxiv.org/pdf/2603….
Taiju Muto
@tai2
Why Models Know But Don’t Say: Chain-of-Thought Faithfulness Divergence Between Thinking Tokens and Answers in Open-Weight Reasoning Models - 2603.26410v1.pdf arxiv.org/pdf/2603….