Groundy — independent coverage of developer tools, infrastructure, and platforms
Penetration Testing Multi-Agent LLM Systems: A Failure Catalog Vendors Don't Document
The first independent pen tests of proprietary agent deployments found preventable classical vulnerabilities, not novel AI flaws, compounding across multi-agent topologies.
securityOpenAI's New Safety Bug Bounty Pays Researchers for Jailbreaks and Policy Bypasses
OpenAI's safety bounties create a vendor-controlled disclosure market where NDAs silence participants, payouts trail serious red-team costs, and open publication has no lane.
One Learning Rate Doesn't Fit All: Heavy-Tail Layerwise LR Schedules for LLM Pretraining
LLR assigns per-layer learning rates from spectral heavy-tail diagnostics during LLM pretraining, achieving 1.5x faster convergence and up to 2 pp higher zero-shot accuracy.
industryOpenAI Buys Statsig and Makes Vijaye Raji CTO of Applications: Product Analytics Becomes Core Infra
OpenAI's $1.1B Statsig deal makes experimentation infrastructure a strategic asset in the AI vertical integration race, pressuring LaunchDarkly and Amplitude.
securityAxios npm Compromise Forces Vercel Into Platform-Level Remediation
When compromised axios npm versions carried a North Korean RAT, Vercel blocked C2 egress at the deploy layer because the npm registry did not verify OIDC provenance.
industryHuggingFace's $100M Series C Bets Open-Source AI Can Outlast Per-Token Pricing Wars
HuggingFace's $100M Series C funds an open-weights infrastructure stack designed to let enterprises avoid escalating per-token API costs from closed-model providers.
securityNext.js Dev Server CVE-2025-48068: Any Web Page Could Read Your Source Files
CVE-2025-48068 lets any webpage read source files from a running Next.js dev server via cross-origin script inclusion, exposing secrets loaded in .env files.
industryVercel's Series F Repackages Frontend Hosting as an AI Cloud Bundle
Vercel's Series F funded an AI middleware stack whose SDK, gateway, and runtime create switching costs, raising the feature bar for rival hosting platforms to stay.
- agents Claude Code, Cursor, Copilot: How Agentic Coding Assistants Get Weaponized as Attacker Shells
- devtools Anthropic Buys Stainless: OpenAI and Google Now Depend on a Rival for SDK Tooling
- agents A New Trust Schema Exposes Why Agent Skill Registries Fail Enterprise Audit Requirements
- policy FTC's TAKE IT DOWN Act Lands May 19: 48-Hour Deepfake NCII Takedowns and No Safe Harbor
- agents CrewAI vs AutoGen vs LangGraph 2026: The Real Trade-Off After Maintenance Mode
- devtools Claude Code Plugins: Anthropic's Official Plugin Ecosystem Explained
- devtools GitHub Copilot vs Cursor vs Claude Code: The 2026 AI Coding Showdown
- infra MLX vs llama.cpp on Apple Silicon: Which Runtime to Use for Local LLM Inference
- models Chinese AI Models Compared: DeepSeek, Qwen, Kimi, Doubao, and Ernie
- models AI Code Generation Benchmarks 2026: Which Model Actually Writes Better Code?
- infra Prefill-Decode Disaggregation: The Architecture Shift Redefining LLM Serving at Scale
- devtools Claude Code in GitHub Actions: A Complete Guide to Automated PR Fixes
- industry Cursor's Meteoric Rise: Inside the AI Editor Hitting $300M ARR
- devtools GitHub Copilot's Opus 4.7 Multiplier: 7.5x to 15x to 27x in 60 Days
- culture EU's 2027 Replaceable Battery Mandate: What It Means for Phone Buyers and Repairers Right Now
- may 26 agents Penetration Testing Multi-Agent LLM Systems: A Failure Catalog Vendors Don't Document
- may 26 security OpenAI's New Safety Bug Bounty Pays Researchers for Jailbreaks and Policy Bypasses
- may 26 models One Learning Rate Doesn't Fit All: Heavy-Tail Layerwise LR Schedules for LLM Pretraining
- may 26 industry OpenAI Buys Statsig and Makes Vijaye Raji CTO of Applications: Product Analytics Becomes Core Infra
- may 26 security Axios npm Compromise Forces Vercel Into Platform-Level Remediation
- may 26 industry HuggingFace's $100M Series C Bets Open-Source AI Can Outlast Per-Token Pricing Wars
- may 26 security Next.js Dev Server CVE-2025-48068: Any Web Page Could Read Your Source Files
- may 26 industry Vercel's Series F Repackages Frontend Hosting as an AI Cloud Bundle
- may 26 infra Gemma 4 31B on Cloud TPU vs GPU: The Serving Cost Crossover Point
- may 26 agents Claude Code, Cursor, Copilot: How Agentic Coding Assistants Get Weaponized as Attacker Shells
- may 26 infra Cloudflare Flagship Is a Feature Flag Service That Deepens Platform Gravity
- may 25 agents Microsoft Bolts Governance Onto Agent Framework as Stack Sprawl Persists
- may 25 policy arXiv Paper Tracks FTC Affiliate Disclosure Gaps in YouTube's Influencer Economy
- may 25 devtools Bun Rewrites Its Core From Zig to Rust, Putting Downstream Zig Bindings at Risk
- may 25 infra ObjectCache Moves KV Reuse to S3-Class Storage: Why Layerwise Retrieval Beats Full-Prefix Cache Hits
- may 25 policy AI Safety Benchmark Rankings Flip Based on Eval Config, SafetyRepro Paper Reports
- may 25 infra Vercel's CDN Origin Timeout Jumps to 2 Minutes: A Concession to LLM Streaming Workloads
- may 25 agents GovernSpec Contractual Skills Make Agent Governance Auditable Before Runtime
- may 25 devtools Vercel Bets on Bun While Post-Acquisition Priority Drift Makes the Runtime a Vendor Decision
- may 25 industry OpenAI Replaces Indeed's Job-Matching Engine: What It Means for ATS Vendors
- may 25 oss One Coding Agent Per Kanban Card: Kanbots Stress-Tests Parallel AI Workflow
- may 25 infra Fluid Compute vs PgBouncer: Vercel's Undocumented Bet on Connection Reuse
- may 25 devtools PromptArmor Shows Microsoft Copilot Cowork Can Be Tricked Into Exfiltrating Files
- may 25 agents Indirect Prompt Injection Benchmarks Were Too Easy: LivePI Adds Realism
- may 25 security Apple Names Claude in CVE Credit Line, Setting Vendor Attribution Precedent
- may 24 industry Vercel Acquires Splitbee to Fold First-Party Analytics Into the Hosting Bundle
- may 24 models Embedding Compression at Training Time: DIVE's Gradient Trick vs Post-Hoc Quantization for Vector DBs
- may 25 devtools Anthropic Buys Stainless: OpenAI and Google Now Depend on a Rival for SDK Tooling
- may 24 models μP Hyperparameter Transfer Has an Embedding Layer Hole, New arXiv Paper Says
- may 25 models Audio LLMs Break When the Codec Changes: A Robustness Vector Voice-AI Teams Haven't Tested
- may 24 policy arXiv 2602.13372 MoralityGym Tests Whether Agents Hold Moral Priorities Across Sequential Decisions
- may 24 devtools Rmux Brings a Playwright SDK to tmux Sessions for Agent Automation Workflows
- may 24 oss Nesbitt's Open Source Death Taxonomy Exposes a Health Score Blind Spot
- may 25 agents Routing LLM Agents: Why TwinRouterBench Splits Static and Live Evaluation
- may 24 infra Vercel Fluid Pools Database Connections Across Invocations, Bypassing External Poolers
- may 23 models Project Glasswing One Month In: AI Bug Discovery Has Outpaced the Patch Pipeline
- may 24 industry SoftBank's $40B Bridge Loan Means Bank Covenants Will Shape OpenAI's Post-IPO Pricing
- may 24 security CISA's Internal Data Leak Tests the Disclosure Standards It Sets for Others
- may 25 infra Railway's GCP Suspension Is a Reseller PaaS Problem, Not a Google One
- may 25 models Do LLMs Know What Not to Say? Causal Evidence for Statistical Preemption
- may 24 security TanStack npm Attack: When OIDC Trusted Publishing Becomes the Attack Vector
- may 24 infra Vercel CDN Request Collapsing: One Origin Fetch Per ISR Cache Miss
- may 25 oss Microsoft Open-Sources the Earliest Known DOS Source Code: What 1980 Tim Paterson 86-DOS Reveals
- may 24 culture OpenAI's Own Economic Analysis Quietly Concedes the Labor Displacement Case
- may 24 security Nx s1ngularity Attackers Used Local Claude Code and Gemini CLI to Steal Developer Tokens
- may 24 infra CISA Admin Leaked AWS GovCloud Keys on GitHub: What Federal Secret Scanning Missed
- may 24 oss Colorado SB051 Carves Out Open Source From Age Verification After Maintainer Backlash
- may 24 oss Colorado SB26-051 Shields Non-Commercial Open Source by Omission, Not by Design
- may 24 devtools Shai-Hulud Returns: 314 npm Packages Compromised in a Self-Propagating Supply-Chain Worm
- may 24 industry OpenAI's S-1 Triggers a Repricing Cascade for Every Private AI Lab Valuation