Topic

#chain-of-thought

3 articles exploring chain-of-thought. Expert insights and analysis from our editorial team.

Showing 1–3 of 3 articles

Articles

Newest first
Models & Research

The Last Word Often Wins: A Format Confound Inflates Chain-of-Thought Corruption Robustness Scores

A format confound in CoT corruption benchmarks—suffix sensitivity collapsed 19× when final-answer text was stripped—means published faithfulness scores are inflated.

Agents & Frameworks

FSE 2026: Chain-of-Thought Fails Per-Bias as Debiasing; Axiomatic Cues Cut Sensitivity 51%

FSE 2026: chain-of-thought fails per-bias on PROBE-SWE SE tasks. Axiomatic cues cut bias sensitivity 51%, exposing gaps in CrewAI, LangChain, Pydantic AI defaults.

Agents & Frameworks

PROBE-SWE Finds Chain-of-Thought and Self-Debiasing Don't Reduce Prompt-Induced Bias in Coding Agents

PROBE-SWE (arXiv 2604.16756) finds chain-of-thought and self-debiasing fail to reduce prompt-induced cognitive bias in SE agents; axiomatic reasoning cues cut it 51%.