Topic

#ai-coding-agents

1 article exploring ai-coding-agents. Expert insights and analysis from our editorial team.

Showing 1–1 of 1 articles

Articles

Newest first

FormulaCode's 957-Task Benchmark Catches Frontier Agents Failing at Real-Codebase Performance Optimization

FormulaCode finds frontier agents trail human experts at repo-scale optimization, exposing SWE-Bench's blind spot: passing patches that never verify real-world speedups.

May 17, 2026

Browse All Topics