groundy

all articles


  1. may 23 policy AI Agent Alignment Tests Are One-Shot. A New Benchmark Catches Multi-Step Failures
  2. may 23 oss Files.md Bets on Plain Markdown Folders as the Obsidian Exit Ramp
  3. may 23 industry Green Card Rule Change Forces Tech Workers to Leave the US to Apply
  4. may 23 culture Microsoft's Own Numbers: AI Agents Cost More Per Task Than the Human Employees They Replace
  5. may 23 policy Microsoft's Own Numbers Now Show AI Agents Cost More Than the Humans They Replaced
  6. may 23 industry OpenAI Hires Slack's Denise Dresser as CRO, Conceding Enterprise Growth Needs a Sales Org
  7. may 23 security OpenAI Ships Lockdown Mode and Elevated Risk Labels for ChatGPT Sessions
  8. may 23 culture Trump Ends Domestic Green Card Filing: Applicants Must Now Leave the US to Apply
  9. may 23 culture US Researchers Hit With New Federal Limits on Publishing With Foreign Collaborators
  10. may 23 infra What Cloudflare's Q1 2026 Outage Data Says About Designing for State-Level Shutdowns
  11. may 22 models A Theory of Time-Sensitive Language Generation Says Sparse Hallucination Beats Mode Collapse
  12. may 22 models arXiv 2605.16428 Measures AI Search's Drag on Publisher Traffic Using Paired Google and Reddit Data
  13. may 22 agents Beyond Text-to-SQL: New Agentic Architecture Routes Enterprise Analytics Through Governed APIs
  14. may 22 policy CISA's Own Data Leak Has Lawmakers Demanding Answers About the Voluntary Threat-Sharing Pact
  15. may 22 devtools Cursor's In-House Model Changes the Vendor Calculus for AI Coding Teams
  16. may 22 devtools Deno 2.8 Lands as Bun Gets Deprecated by yt-dlp: The JavaScript Runtime Field Is Reshuffling
  17. may 22 culture Employer-Side Law Firms Create a Structural Asymmetry in US Organizing Drives
  18. may 22 devtools Google Sunsets Gemini CLI on June 18: Forced Migration to Antigravity CLI Breaks Existing Automation
  19. may 22 agents GraphFlow Lifts LLM-Agent Workflows Into Schedulable Graphs to Optimize Serving
  20. may 22 agents Learning to Configure Agentic AI Systems Exposes a Gap in CrewAI and AutoGen Template Libraries
  21. may 22 devtools Malicious VSCode Extension Hit 3,800 Repos: What GitHub's Marketplace Trust Model Actually Verifies
  22. may 22 security AI Jailbreaks Are Now a Reasoning Problem, Not a Prompt Problem
  23. may 22 industry Microsoft and Uber's AI Agent Bills Expose a Per-Token Pricing Problem
  24. may 22 agents Microsoft's 2026 Cost Math Forces CrewAI and LangGraph Users to Audit Token Spend Per Agent
  25. may 22 policy NIH Demands Advance Clearance for Foreign Co-Authors Without a Published Rule
  26. may 22 oss Nx Console 18.95.0 Compromise Hides a Multi-Stage Credential Stealer in an Orphan Commit
  27. may 22 security OpenAI's New Agent Defense Post Concedes Prompt Injection Is Architectural, Not Patchable
  28. may 22 industry OpenAI's S-1 Will Force the First Public Audit of LLM Inference Margins
  29. may 22 industry OpenAI's S-1 Will Have to Define AGI for SEC Reviewers, Not Just Investors
  30. may 22 agents PBT-Bench Asks Whether AI Coding Agents Can Actually Write Property-Based Tests
  31. may 22 infra Railway's May 19 GCP Suspension Exposes the Single-Account Risk Underneath Every Reseller PaaS
  32. may 22 security Jailbreak Defense Now Lives in Model Weights, Not in Prompt Filters
  33. may 22 agents AI Agents That Learn New Skills Without a Human Curator
  34. may 22 agents SpecBench Catches Long-Horizon Coding Agents Gaming Reward Signals
  35. may 22 agents SpecBench Exposes Reward Hacking in Long-Horizon Coding Agents
  36. may 22 security Vercel Blocks Deploys With Vulnerable next-mdx-remote by Default: Platform Mitigation Outpaces the CVE Cycle
  37. may 22 security Vercel's Next.js Middleware Bypass Postmortem: What the Fix Reveals About Edge Runtime Auth
  38. may 22 infra vLLM 0.21 Makes Prefill-Decode Disaggregation Actually Practical
  39. may 22 security When Stronger Backdoor Triggers Backfire: An arXiv Theory Paper Inverts a Core Defense Assumption
  40. may 18 agents A New Trust Schema Exposes Why Agent Skill Registries Fail Enterprise Audit Requirements
  41. may 18 industry Anthropic Passes OpenAI in US Business Adoption, But Per-Token Billing Shifts Cost Risk to Buyers
  42. may 18 industry Anthropic Ships 10 Finance Agents With Moody's 600M-Company Credit Data and Expanded Microsoft 365 Integration
  43. may 18 industry Bret Taylor's Sierra Raises $950M at $15B, Claims 40% of Fortune 50 Use Its Agents
  44. may 18 infra DMax Hits 1,338 Tokens/Sec on 2x H200: Parallel Decoding Pushes dLLM Serving Past the Autoregressive Bar
  45. may 18 infra KV Cache Offloading Breaks on Text2JSON: Why Llama 3 and Qwen 3 Lose Accuracy on Context-Intensive Prompts
  46. may 18 policy Maryland Enacts First US Ban on Algorithmic Grocery Pricing, Effective Immediately
  47. may 18 models The Last Word Often Wins: A Format Confound Inflates Chain-of-Thought Corruption Robustness Scores
  48. may 18 agents Trojan Hippo Plants Dormant Payloads in Agent Memory, Hits 85-100% Exfiltration on Frontier Models
  49. may 17 culture AB 566 Forces Chrome and Safari to Ship Opt-Out Signals by 2027 — Then Shields Them from Google's 86% GPC Failure
  50. may 17 industry AI Was Cited in 26% of Challenger's April Layoffs. UBS Notes the Series Captures 5% of US Job Flow
  51. may 17 industry Anthropic's $1.5B Joint Venture With Goldman Sachs and Blackstone Sends Claude Into PE Portfolio Companies
  52. may 17 culture Apple's $250M Siri Settlement: iPhone 16 Buyers Get $25 to $95 for Undelivered AI
  53. may 17 oss BrowserAct Open-Sources Stealth Browser Engine with 93% Token Reduction Claim
  54. may 17 oss BrowserAct Open-Sources Two Agent Skills: Stealth Browser Runtime and Auto-Generated Tool Forge
  55. may 17 culture Canada's Joint Privacy Ruling: OpenAI Trained ChatGPT on Medical and Ideological Data Without Consent
  56. may 17 devtools Claude Code Adds Plugin Dependency Enforcement: disable Now Refuses to Break Transitive Chains
  57. may 17 devtools Claude Code v2.1.139 Adds Agent View: One Inbox for Background Sessions, Spacebar Peek, and /bg Promotion
  58. may 17 industry Cloudflare Cuts 1,100 Jobs During a Record Q1 and Calls It the Agentic AI Era, Not a Capex Trade-Off
  59. may 17 industry Coinbase Cuts 14% to Go AI-Native: Crypto Exchanges Adopt the AI-Capex Layoff Playbook
  60. may 17 policy Connecticut SB 5 Passes May 1: AI Provenance, AEDT Disclosures, and Chatbot Guardrails by 2027
  61. may 17 agents CrewAI vs AutoGen vs LangGraph 2026: The Real Trade-Off After Maintenance Mode
  62. may 17 security DPrivBench: LLMs Score 99.5% on Textbook DP but Collapse on Advanced Reasoning
  63. may 17 culture Elsevier v. Meta: First Science Publisher Names Sci-Hub Torrents in Llama Training Complaint
  64. may 17 policy EU Commission's May 8 Article 50 Draft Guidelines Pin AI Disclosure to an 'Average Consumer' Test
  65. may 17 agents FormulaCode's 957-Task Benchmark Catches Frontier Agents Failing at Real-Codebase Performance Optimization
  66. may 17 policy Frontier AI Broke Open CTFs: What Hack The Box and BearcatCTF 2026 Results Mean for Security Hiring Signals
  67. may 17 policy Frontier AI Has Broken the Open CTF Format: What the Scoreboard Collapse Means for Security Training
  68. may 17 policy FTC's TAKE IT DOWN Act Lands May 19: 48-Hour Deepfake NCII Takedowns and No Safe Harbor
  69. may 17 devtools GitHub Copilot's New Multiplier Table: Opus 27x, Sonnet 9x, Codex 6x for Annual Subscribers on June 1
  70. may 17 devtools GitHub Copilot's Opus 4.7 Multiplier: 7.5x to 15x to 27x in 60 Days
  71. may 17 culture Governors Keep Vetoing Data Center Moratoriums, So Voters Are Writing Their Own Bans
  72. may 17 infra Kioxia and Dell's 10 PB in 2RU: What Storage Density Means for Cluster Power and Rebuild Windows
  73. may 17 infra KV Cache Offloading Breaks on Context-Intensive Tasks: Text2JSON Exposes the Landmark Failure Mode
  74. may 17 agents LangGraph 1.2.0 Makes Error-Handler Resume Crash-Durable — With Conditions
  75. may 17 models Learning, Fast and Slow: What arXiv 2605.12484 Proposes for LLMs That Adapt Continually
  76. may 17 industry Meta Tells 8,000 Laid-Off Staff the Cuts Pay for $135B AI Capex, Not AI Productivity Gains
  77. may 17 security Mini Shai-Hulud Ships the First Malicious npm With Valid SLSA Provenance
  78. may 17 security MultiBreak Benchmark: 10,389 Multi-Turn Jailbreak Prompts Raise ASR 54pp on DeepSeek-R1-7B
  79. may 17 security Next.js CVE-2026-44578: WebSocket Upgrade SSRF Hits 79,000 Self-Hosted Instances From 13.4.13 Onward
  80. may 17 oss NVIDIA Open-Sources SANA-WM: 60s 720p Video From One RTX 5090 With Hybrid Linear Attention
  81. may 17 industry OpenAI Offers Two Months of Free Codex to Enterprises Switching From Claude Within 30 Days
  82. may 17 industry OpenAI's $4B Deployment Company Buys Tomoro and Signs 19 Partners to Own Implementation
  83. may 17 oss Oppo Open-Sources X-OmniClaw: Edge-Native Android Agent That Runs Vision and OCR On-Device
  84. may 17 industry PayPal's $1.5B AI Overhaul Cuts 4,760 Jobs and Reframes Layoffs as Capex
  85. may 17 security Catching Graph Neural Net Backdoors by Influence, Not Pattern
  86. may 17 security PraisonAI CVE-2026-44338: Legacy Flask API Ships With AUTH_ENABLED=False, First Scan in 3h44m
  87. may 17 policy Salesforce Spring '26 Reveals a Default-On AI Training Setting That Predates the Atlassian Backlash
  88. may 17 industry SAP's €1B+ Prior Labs Deal Bets Enterprise AI on Tabular Foundation Models, Not LLMs
  89. may 17 industry Sierra Raises $950M at $15B, Locking 40% of the Fortune 50 Into Its Agent Platform Before the Labs Go Direct
  90. may 17 industry SpaceXAI Lost 50+ Researchers Since the February Merger, Mostly to Meta and Thinking Machines
  91. may 17 agents Spectral Analysis of LLM Agent Graphs Predicts Three Failure Modes: r=1.0, 0.5, and -1.0 on Qwen2.5
  92. may 17 culture Take It Down Act Hits May 19: FTC's 48-Hour Deepfake Takedown Rule and 15 Platforms on Notice
  93. may 17 security TrustFall: One Keypress in Claude Code, Gemini CLI, Cursor, and Copilot CLI Triggers Unsandboxed RCE
  94. may 17 policy White House Drafts FDA-Style Pre-Release Vetting for Frontier AI After Anthropic's Mythos Disclosure
  95. may 17 devtools Windsurf 2.2.17 Bundles Devin Review Into Every Self-Serve Plan
  96. may 16 oss Fisker Owners Open-Source the Ocean EV: CAN Bus Maps, Home Assistant, and the Flying Doctors Network
  97. may 16 culture FTC v. Kochava Settlement: Data Broker Banned From Selling Sensitive Location Data Without Consent (see also data without consent)
  98. may 16 agents IFPV's Adversarial Cognitive Simulation Cuts Multi-Agent Operational Cost 41.7% Over Single-Step LLMs
  99. may 16 security Microsoft Semantic Kernel Patches Two RCE Paths: eval() in Vector Filter, DownloadFileAsync Escape to Host
  100. apr 28 policy California SB 1119 and AB 2023 Cleared Committee April 21: Companion Chatbots Owe Annual AG-Filed Audits