Topic

#code-quality

1 article exploring code-quality. Expert insights and analysis from our editorial team.

Showing 1–1 of 1 articles

Articles

Newest first

SWE-Bench's Dirty Secret: AI-Passing PRs That Real Engineers Would Reject

New research from METR shows roughly half of SWE-bench-passing AI-generated PRs would be rejected by actual project maintainers—exposing a 24-percentage-point gap between benchmark scores and real-world code acceptability.

March 13, 2026 · 9 min read

Browse All Topics