Ethics, Policy & Safety
34 articles exploring Ethics, Policy & Safety. Expert analysis and insights from our editorial team.
The policy and safety layer of AI is not abstract: it determines which systems get deployed in courtrooms, hospitals, and public infrastructure, and what accountability exists when they fail. This cluster covers AI safety research, algorithmic harm cases, regulatory divergence, and the transparency collapse at frontier labs.
The Stanford HAI 2026 AI Index puts the transparency problem in specific numbers: average FMTI transparency scores dropped from 58 to 40 in a single year, a 31% collapse. Frontier labs are simultaneously asking for public trust and disclosing less about training data, evaluation methodology, and safety testing. The gap between claimed safety assurances and auditable evidence is widening, not narrowing.
Regulatory divergence between the EU and US has moved from policy debate to compliance reality. The EU AI Act’s risk-tier framework is now being implemented—requirements for high-risk systems, banned applications, and conformity assessments—while the US moved toward lighter-touch executive guidance. The result is a genuine Balkanization problem for AI products operating across jurisdictions.
Algorithmic harm cases are no longer edge cases. Groundy has covered the wrongful arrest pattern in AI facial recognition—at least eight documented cases, nearly all affecting Black individuals, in a system where NIST studies show 10-100x higher error rates for darker-skinned faces. The issue is not only the technology; it is the institutional trust police departments extend to vendors whose accuracy claims don’t survive demographic auditing.
Constitutional AI, RLHF alignment, and safety via output filtering represent different philosophical approaches to the same problem: how do you make models that reliably refuse harmful requests while remaining useful? Groundy covers the research critically—distinguishing genuine safety gains from sophisticated filtering that degrades at the edges of distribution.
Scientific integrity is an underreported ethics story. AI-enabled paper mills are flooding academic literature with automated fraudulent research at a pace peer review cannot match; the journals most affected are the ones where AI tools also do most of the checking. When AI enables fraud and AI is supposed to detect it, the feedback loop is not self-correcting. Groundy covers the governance implications—for publishers, for researchers, and for AI training pipelines that ingest scientific literature.
Featured in this cluster
Constitutional AI: Teaching Models to Self-Correct Before They Act
Anthropic's Constitutional AI trains language models to critique and revise their own outputs using principles rather than human labels, but questions remain about whether this represents genuine safety gains or sophisticated filtering mechanisms.
CornerstoneUS vs. EU AI Regulation: Two Incompatible Visions for the AI Future
The EU enforces strict AI rules while the US deregulates — creating a compliance nightmare for global AI companies and risking permanent Balkanization of AI.
CornerstoneWrongfully Jailed by an Algorithm: AI Facial Recognition's Misidentification Crisis
At least eight innocent people—nearly all Black—have been wrongfully arrested because police trusted AI facial recognition systems that government studies show misidentify darker-skinned faces at rates 10 to 100 times higher than white faces. The crisis isn't the technology alone; it's the institutional trust placed in documented bias.
CornerstoneStanford's 2026 AI Index: Frontier Model Transparency Scores Collapsed 31% in One Year
The 2025 FMTI found average transparency scores dropped from 58 to 40 in a single year. Here's what that means for auditors and responsible deployment.
Latest in Ethics, Policy & Safety
Maryland Enacts First US Ban on Algorithmic Grocery Pricing, Effective Immediately
Governor Moore signed Maryland's Protection From Predatory Pricing Act on April 28, making it the first US state law to ban AI-based grocery pricing, effective immediately.
Connecticut SB 5 Passes May 1: AI Provenance, AEDT Disclosures, and Chatbot Guardrails by 2027
Connecticut SB 5 requires provenance for large generative platforms by October 2026, AEDT disclosures for HR tools, and companion-chatbot guardrails for minors by January.
EU Commission's May 8 Article 50 Draft Guidelines Pin AI Disclosure to an 'Average Consumer' Test
The EU Commission's May 8 draft guidelines set an 'average consumer' standard for AI disclosure exemptions under Article 50, with a multi-factor vulnerable-group test that.
Frontier AI Broke Open CTFs: What Hack The Box and BearcatCTF 2026 Results Mean for Security Hiring Signals
Frontier AI now ranks in the top 5% of CTFs, eroding leaderboards as a security hiring signal and forcing organizers toward bans, hybrid scoring, or AI-only divisions.
Frontier AI Has Broken the Open CTF Format: What the Scoreboard Collapse Means for Security Training
Frontier AI now autonomously solves medium and hard CTF challenges, collapsing open scoreboards as a measure of human skill and threatening the pipeline for security talent.
FTC's TAKE IT DOWN Act Lands May 19: 48-Hour Deepfake NCII Takedowns and No Safe Harbor
Ferguson's May 11 warning letters put 15 UGC platforms on notice as Section 3 of the TAKE IT DOWN Act activates May 19, requiring 48-hour NCII removal with no safe harbor.
Salesforce Spring '26 Reveals a Default-On AI Training Setting That Predates the Atlassian Backlash
Salesforce's Spring '26 toggle surfaced a default-on AI training posture dating to 2018, joining GitHub and Atlassian in a spring wave that shifts privacy burden to buyers.
White House Drafts FDA-Style Pre-Release Vetting for Frontier AI After Anthropic's Mythos Disclosure
The White House is studying FDA-style pre-release vetting for frontier AI after Anthropic's Mythos disclosure, but a fast walkback and internal feud have left policy in limbo.
California SB 1119 and AB 2023 Cleared Committee April 21: Companion Chatbots Owe Annual AG-Filed Audits
California companion-chatbot bills advanced in April 2026, mandating annual AG-filed audits, hard usage caps for minors, and per-child civil liability.
Citizen Lab Names Three Telcos as Persistent Entry Points for Commercial SS7 Surveillance Vendors
Citizen Lab names 019Mobile, Tango Networks, and Airtel Jersey as persistent entry points for commercial SS7 surveillance vendors, shifting accountability to named carriers.
Symbolic Guardrails for AI Agents: Hard Safety Guarantees Without Crippling Capability
A new paper shows symbolic guardrails can push agent safety to 100% in regulated domains without capability loss — but only for 74% of real-world policies.
America's AI Researcher Pipeline Dropped 89%. What the Stanford Index Means for Teams Hiring AI Engineers
Stanford's 2026 AI Index reports an 89% collapse in AI researcher inflows to the US. Here's what it means for teams actively building AI engineering capacity.
Atlassian Turned On AI Training Data Collection by Default — Here's What to Disable
Atlassian's data contribution policy sends Jira and Confluence content to AI training by default. Here's the exact settings path to opt out before August 17.
Stanford's 2026 AI Index: Frontier Model Transparency Scores Collapsed 31% in One Year
The 2025 FMTI found average transparency scores dropped from 58 to 40 in a single year. Here's what that means for auditors and responsible deployment.
The AI Grief Split: When Emotional Bonds with Language Models Break
People form real emotional bonds with AI companions. When models update or shut down, users experience genuine grief—a psychological and ethical crisis point.