Topic
#chain-of-thought
3 articles exploring chain-of-thought. Expert insights and analysis from our editorial team.
Showing 1–3 of 3 articles
Articles
Newest first
Models & Research
The Last Word Often Wins: A Format Confound Inflates Chain-of-Thought Corruption Robustness Scores
A format confound in CoT corruption benchmarks—suffix sensitivity collapsed 19× when final-answer text was stripped—means published faithfulness scores are inflated.
Agents & Frameworks
FSE 2026: Chain-of-Thought Fails Per-Bias as Debiasing; Axiomatic Cues Cut Sensitivity 51%
FSE 2026: chain-of-thought fails per-bias on PROBE-SWE SE tasks. Axiomatic cues cut bias sensitivity 51%, exposing gaps in CrewAI, LangChain, Pydantic AI defaults.
Agents & Frameworks
PROBE-SWE Finds Chain-of-Thought and Self-Debiasing Don't Reduce Prompt-Induced Bias in Coding Agents
PROBE-SWE (arXiv 2604.16756) finds chain-of-thought and self-debiasing fail to reduce prompt-induced cognitive bias in SE agents; axiomatic reasoning cues cut it 51%.