Topic

#formulacode

1 article exploring formulacode. Expert insights and analysis from our editorial team.

Showing 1–1 of 1 articles

Articles

Newest first
Agents & Frameworks

FormulaCode's 957-Task Benchmark Catches Frontier Agents Failing at Real-Codebase Performance Optimization

FormulaCode finds frontier agents trail human experts at repo-scale optimization, exposing SWE-Bench's blind spot: passing patches that never verify real-world speedups.