Topic

#code-quality

1 article exploring code-quality. Expert insights and analysis from our editorial team.

Showing 1–1 of 1 articles

Articles

Newest first
AI Engineering

SWE-Bench's Dirty Secret: AI-Passing PRs That Real Engineers Would Reject

New research from METR shows roughly half of SWE-bench-passing AI-generated PRs would be rejected by actual project maintainers—exposing a 24-percentage-point gap between benchmark scores and real-world code acceptability.

· 9 min read