Groundy — independent coverage of developer tools, infrastructure, and platforms
Web Agents Can Be Talked Into Abandoning Their Task: The TRAP Benchmark
The TRAP benchmark finds 13 to 43 percent of web agent tasks can be redirected by persuasive page content, exposing a blind spot in current instruction-hierarchy defenses.
securityShallow Neural Nets Beat LLM Guardrails at Catching Prompt Injection
GuardNet's 47M-parameter BiLSTM ensemble detects prompt injections in 50 ms on CPU, but 0.747 blind-benchmark AUROC and classifier-evasion risks leave the arms race.
When an AI Agent Clicks a Link: OpenAI's Data-Exfiltration Model
OpenAI's URL provenance filter concedes content inspection is intractable. Agents that mix sensitive data with web access face a structural exfiltration risk.
agentsWhy Foundation Model Agents Pass Benchmarks but Fail in Production
A June 2026 paper frames the AI agent benchmark gap as a sim-to-real problem, giving eval teams a four-part MDP checklist to challenge vendor claims before live deployment.
industryVercel's Rox Case Study Pitches AI Agents as a Revenue Operating System
Vercel's shift from frontend hosting to agent infrastructure is backed by real products and a $9.3B valuation. Whether per-token billing beats per-seat SaaS remains unproven.
industryAI Patent Valuation Models Aim to Replace the Expert Appraiser
A new framework decomposes patent value into per-feature Shapley credits, but courts have not ruled on whether model output replaces expert testimony in damages and M&A.
policyData Safety Policies for AI Agents: Controlling What an Agent Can Leak
A June 2026 paper proposes Data Flow Control, moving agent data safety from prompt-level guardrails to deterministic, auditable SQL query policies enforced outside the model.
agentsCan AI Agents Repair Broken Network Configs? A New Benchmark Tests It
LLM agents with formal verification repair 12% more network misconfigurations than base models and are 17% safer, but regress on large topologies, limiting production use.
- models Opus 4.8 vs Opus 4.7: What Changed and What Did Not
- agents Claude Code, Cursor, Copilot: How Agentic Coding Assistants Get Weaponized as Attacker Shells
- devtools Anthropic Buys Stainless: OpenAI and Google Now Depend on a Rival for SDK Tooling
- agents A New Trust Schema Exposes Why Agent Skill Registries Fail Enterprise Audit Requirements
- policy FTC's TAKE IT DOWN Act Lands May 19: 48-Hour Deepfake NCII Takedowns and No Safe Harbor
- devtools GitHub Copilot vs Cursor vs Claude Code: The 2026 AI Coding Showdown
- infra MLX vs llama.cpp on Apple Silicon: Which Runtime to Use for Local LLM Inference
- models AI Code Generation Benchmarks 2026: Which Model Actually Writes Better Code?
- devtools Claude Code in GitHub Actions: A Complete Guide to Automated PR Fixes
- devtools Claude Code Plugins: Anthropic's Official Plugin Ecosystem Explained
- models Chinese AI Models Compared: DeepSeek, Qwen, Kimi, Doubao, and Ernie
- devtools GitHub Copilot's Opus 4.7 Multiplier: 7.5x to 15x to 27x in 60 Days
- devtools GitHub Copilot Replaces Premium Request Units With Token-Metered AI Credits on June 1
- culture EU's 2027 Replaceable Battery Mandate: What It Means for Phone Buyers and Repairers Right Now
- industry Cursor's Meteoric Rise: Inside the AI Editor Hitting $300M ARR
- jun 07 security Web Agents Can Be Talked Into Abandoning Their Task: The TRAP Benchmark
- jun 07 security Shallow Neural Nets Beat LLM Guardrails at Catching Prompt Injection
- jun 07 security When an AI Agent Clicks a Link: OpenAI's Data-Exfiltration Model
- jun 07 agents Why Foundation Model Agents Pass Benchmarks but Fail in Production
- jun 07 industry Vercel's Rox Case Study Pitches AI Agents as a Revenue Operating System
- jun 07 industry AI Patent Valuation Models Aim to Replace the Expert Appraiser
- jun 06 policy Data Safety Policies for AI Agents: Controlling What an Agent Can Leak
- jun 06 agents Can AI Agents Repair Broken Network Configs? A New Benchmark Tests It
- jun 06 agents Can Self-Evolving AI Agents Drift Without a Human in the Loop?
- jun 06 culture A Covert LLM Persuasion Experiment Was Shut Down: How Far Did the Bots Get?
- jun 06 infra Indexing Images for RAG: kapa.ai's Approach to Multimodal Retrieval
- jun 06 models Can LLMs Leak Training Data? A New Test Splits Capacity From Intent
- jun 06 policy GDPR Rectification Rights Have No Clear Owner in ML Supply Chains
- jun 06 security Benchmarking RAG Over Cyber Threat Intelligence: Where Retrieval Breaks
- jun 06 models When an AI Agent's Tools Break, Can It Recover? A New Benchmark
- jun 06 industry US Hyperscale Data Centers: A Carbon Audit That Recasts AI Power Costs
- jun 05 infra The RTX Spark Bet on Unified Memory for Local LLMs: Where Bandwidth Caps It
- jun 05 infra Reading Vercel's Fluid Compute vs Cloudflare Workers Benchmark
- jun 05 agents Fine-Tuning Multi-Agent LLM Systems: RL Enters Where Prompt Tweaks Stall
- jun 05 security Stronger Safety Alignment Made LLMs Easier to Jailbreak, Not Harder
- jun 05 security SAML Signature Bypass Is Back: Inside the SAMLStorm Vulnerability Class
- jun 05 policy When LLM Safety Lives at Inference, Not Training: A Certification Gap
- jun 05 culture Do LLMs Understand Idioms in Low-Resource Languages?
- jun 05 infra Does CUDA Tile Match Hand-Tuned Kernels on Hopper and Blackwell?
- jun 05 security SAMLStorm: The SAML Signature Bug That Forges Valid SSO Logins
- jun 05 models MiniMax M3 Bets on Sparse Attention for 1M Context. Does the Math Hold?
- jun 05 models Can One Model Handle Every CAD Task? UniCAD Tests It
- jun 05 models Do Foundation Models Actually Learn Relational Structure In-Context?
- jun 05 models Can LLMs Write Better Research Paper Titles Than Authors?
- jun 05 models Does Information-Theoretic Example Selection Beat kNN for In-Context Learning?
- jun 05 infra Pod-Level Remote Attestation in Kubernetes: Confidential Workloads on dstack
- jun 05 models Do Concept Bottleneck Model Benchmarks Measure Interpretability or Dataset Bias?
- jun 05 agents Cascading Hallucination in Agentic RAG: When One Bad Retrieval Poisons the Chain
- jun 05 security Vercel's Flags SDK Exposed Feature-Flag Definitions via CVE-2025-46332
- jun 05 models Continuous Bit-Width Quantization vs Fixed INT4: Does LiftQuant Beat Discrete?
- jun 04 models Federated Learning for Industrial IoT Anomaly Detection: The Data-Locality Tradeoff
- jun 04 infra Generating GPU Kernels for Moore Threads Silicon: Can LLMs Break CUDA Lock-In?
- jun 04 devtools Alibaba's Open Code Review Moves AI Review Into the CLI, Not the PR
- jun 04 infra Microsoft's Azure Linux Goes General-Purpose: The Container Base-Image Play
- jun 04 models Reading Failed LLM Reasoning Traces Won't Tell You Which Ones RL Can Fix
- jun 04 agents Can AI Agents Build Other Agents? The Meta-Agent Challenge Says Mostly Not Yet
- jun 04 models Can You Stitch Two Foundation Models Together Without Retraining?
- jun 04 infra Cloudflare Acquires VoidZero, the Company Behind Vite's Rust Toolchain
- jun 04 security Jailbreak Suffixes Hit Harder at Specific Token Positions, New GCG Variant Shows
- jun 04 policy When Should an LLM Forget You? A Benchmark for Deciding What Memory to Drop
- jun 04 security OpenAI Adds Lockdown Mode to ChatGPT, Shifting Prompt-Injection Risk to Users
- jun 04 policy When RL Training Rewards Capability-Seeking: A New Alignment Risk
- jun 04 models Do Reasoning LLMs Waste Tokens? OckBench Tries to Measure It
- jun 04 security Activation Steering Was Sold as LLM Control. New Work Makes It an Attack Surface
- jun 04 culture Can Teaching Logical Fallacies Inoculate People Against AI Misinformation?