Category

Ethics, Policy & Safety

34 articles exploring Ethics, Policy & Safety. Expert analysis and insights from our editorial team.

Showing 1–15 of 34 articles · Page 1 of 3

The policy and safety layer of AI is not abstract: it determines which systems get deployed in courtrooms, hospitals, and public infrastructure, and what accountability exists when they fail. This cluster covers AI safety research, algorithmic harm cases, regulatory divergence, and the transparency collapse at frontier labs.

The Stanford HAI 2026 AI Index puts the transparency problem in specific numbers: average FMTI transparency scores dropped from 58 to 40 in a single year, a 31% collapse. Frontier labs are simultaneously asking for public trust and disclosing less about training data, evaluation methodology, and safety testing. The gap between claimed safety assurances and auditable evidence is widening, not narrowing.

Regulatory divergence between the EU and US has moved from policy debate to compliance reality. The EU AI Act’s risk-tier framework is now being implemented—requirements for high-risk systems, banned applications, and conformity assessments—while the US moved toward lighter-touch executive guidance. The result is a genuine Balkanization problem for AI products operating across jurisdictions.

Algorithmic harm cases are no longer edge cases. Groundy has covered the wrongful arrest pattern in AI facial recognition—at least eight documented cases, nearly all affecting Black individuals, in a system where NIST studies show 10-100x higher error rates for darker-skinned faces. The issue is not only the technology; it is the institutional trust police departments extend to vendors whose accuracy claims don’t survive demographic auditing.

Constitutional AI, RLHF alignment, and safety via output filtering represent different philosophical approaches to the same problem: how do you make models that reliably refuse harmful requests while remaining useful? Groundy covers the research critically—distinguishing genuine safety gains from sophisticated filtering that degrades at the edges of distribution.

Scientific integrity is an underreported ethics story. AI-enabled paper mills are flooding academic literature with automated fraudulent research at a pace peer review cannot match; the journals most affected are the ones where AI tools also do most of the checking. When AI enables fraud and AI is supposed to detect it, the feedback loop is not self-correcting. Groundy covers the governance implications—for publishers, for researchers, and for AI training pipelines that ingest scientific literature.

Featured in this cluster

Latest in Ethics, Policy & Safety

Newest first
01

Maryland Enacts First US Ban on Algorithmic Grocery Pricing, Effective Immediately

Governor Moore signed Maryland's Protection From Predatory Pricing Act on April 28, making it the first US state law to ban AI-based grocery pricing, effective immediately.

02

Connecticut SB 5 Passes May 1: AI Provenance, AEDT Disclosures, and Chatbot Guardrails by 2027

Connecticut SB 5 requires provenance for large generative platforms by October 2026, AEDT disclosures for HR tools, and companion-chatbot guardrails for minors by January.

03

EU Commission's May 8 Article 50 Draft Guidelines Pin AI Disclosure to an 'Average Consumer' Test

The EU Commission's May 8 draft guidelines set an 'average consumer' standard for AI disclosure exemptions under Article 50, with a multi-factor vulnerable-group test that.

04

Frontier AI Broke Open CTFs: What Hack The Box and BearcatCTF 2026 Results Mean for Security Hiring Signals

Frontier AI now ranks in the top 5% of CTFs, eroding leaderboards as a security hiring signal and forcing organizers toward bans, hybrid scoring, or AI-only divisions.

05

Frontier AI Has Broken the Open CTF Format: What the Scoreboard Collapse Means for Security Training

Frontier AI now autonomously solves medium and hard CTF challenges, collapsing open scoreboards as a measure of human skill and threatening the pipeline for security talent.

06

FTC's TAKE IT DOWN Act Lands May 19: 48-Hour Deepfake NCII Takedowns and No Safe Harbor

Ferguson's May 11 warning letters put 15 UGC platforms on notice as Section 3 of the TAKE IT DOWN Act activates May 19, requiring 48-hour NCII removal with no safe harbor.

07

Salesforce Spring '26 Reveals a Default-On AI Training Setting That Predates the Atlassian Backlash

Salesforce's Spring '26 toggle surfaced a default-on AI training posture dating to 2018, joining GitHub and Atlassian in a spring wave that shifts privacy burden to buyers.

08

White House Drafts FDA-Style Pre-Release Vetting for Frontier AI After Anthropic's Mythos Disclosure

The White House is studying FDA-style pre-release vetting for frontier AI after Anthropic's Mythos disclosure, but a fast walkback and internal feud have left policy in limbo.

09

California SB 1119 and AB 2023 Cleared Committee April 21: Companion Chatbots Owe Annual AG-Filed Audits

California companion-chatbot bills advanced in April 2026, mandating annual AG-filed audits, hard usage caps for minors, and per-child civil liability.

10

Citizen Lab Names Three Telcos as Persistent Entry Points for Commercial SS7 Surveillance Vendors

Citizen Lab names 019Mobile, Tango Networks, and Airtel Jersey as persistent entry points for commercial SS7 surveillance vendors, shifting accountability to named carriers.

11

Symbolic Guardrails for AI Agents: Hard Safety Guarantees Without Crippling Capability

A new paper shows symbolic guardrails can push agent safety to 100% in regulated domains without capability loss — but only for 74% of real-world policies.

· 6 min read
12

America's AI Researcher Pipeline Dropped 89%. What the Stanford Index Means for Teams Hiring AI Engineers

Stanford's 2026 AI Index reports an 89% collapse in AI researcher inflows to the US. Here's what it means for teams actively building AI engineering capacity.

· 6 min read
13

Atlassian Turned On AI Training Data Collection by Default — Here's What to Disable

Atlassian's data contribution policy sends Jira and Confluence content to AI training by default. Here's the exact settings path to opt out before August 17.

· 6 min read
14

Stanford's 2026 AI Index: Frontier Model Transparency Scores Collapsed 31% in One Year

The 2025 FMTI found average transparency scores dropped from 58 to 40 in a single year. Here's what that means for auditors and responsible deployment.

· 6 min read
15

The AI Grief Split: When Emotional Bonds with Language Models Break

People form real emotional bonds with AI companions. When models update or shut down, users experience genuine grief—a psychological and ethical crisis point.

· 10 min read

Explore More Categories

Discover insights across different technology domains.

Browse All Articles