Groundy — independent coverage of developer tools, infrastructure, and platforms
OpenAI's TanStack npm Writeup Shifts Dependency-Control Burden onto AI Tooling Teams
OpenAI's TanStack npm writeup is its second macOS signing compromise within a month, and it raises the dependency-control bar for every team pulling npm into AI tooling.
devtoolsVercel Detects Bun Lockfiles for Affected Builds as Text bun.lock Stabilizes
Vercel now detects Bun lockfiles to skip untouched monorepo builds, and Bun v1.2 defaults to a text bun.lock that diffs in git, so teams can retire the binary bun.lockb.
Apple Raises Mac and iPad Prices as AI Memory Demand Drains DRAM Supply
Apple's Mac and iPad price hikes trace back to AI demand draining DRAM supply: HBM stacks now consume the same wafers, leaving every memory-heavy device carrying an AI tax.
infraVercel's Anti-Lock-In Pitch: What the Open-Source Bet Still Locks In
Vercel markets Next.js and the AI SDK as open source and portable, but the paid platform, from deploy previews to Fluid Compute, does not travel with the code.
ossEmotion Vectors Replicate in Open-Source LLMs, but Steering Is Unproven
A June 2026 preprint shows the open-weight models Apertus-8B and Gemma-4-E4B encode emotion vectors at r=0.76 to 0.83, but does not prove steering controls behavior.
modelsDoes Tree-of-Thought Reasoning Scale to Billion-User Modeling?
ScaleToT distills tree-of-thought reasoning into a profile encoder that serves billion-user recommenders without per-user LLM calls, lifting LT30 by 6.738% in an A/B test.
agentsDo AI Agents Hold Up Outside Familiar Environments? A New Eval Says No
A 100-task benchmark finds the frontier AI agent clears 19.1% of vision-heavy tasks where non-experts top 80%. Leaderboard scores don't transfer to deployment.
infraVercel Adds Tag-Based CDN Cache Invalidation: Surrogate Keys at the Edge
Vercel's January 2026 Vercel-Cache-Tag ship brings surrogate-key cache invalidation to every plan, moving the cache contract into application-owned tag strings.
- agents MCP vs A2A: Two Agent Protocols, One Integration Layer Decision
- models GLM-5.2 vs Kimi K2.7 Code: Two Open-Weight Bets on Agentic Coding
- devtools Cursor Goes to SpaceX, Windsurf to Cognition: What Changes for Dev Teams
- policy US Export Order Forces Anthropic to Disable Fable 5 and Mythos 5 Worldwide
- infra MiniMax M3 Ships 1M Context and Desktop Control as Open Weights
- devtools GitHub Copilot vs Cursor vs Claude Code: The 2026 AI Coding Showdown
- industry Cursor's Meteoric Rise: Inside the AI Editor Hitting $300M ARR
- models GLM-5.2 Benchmarks: What 62.1% SWE-bench Pro and 99.2% AIME Actually Mean
- models AI Code Generation Benchmarks 2026: Which Model Actually Writes Better Code?
- models Chinese AI Models Compared: DeepSeek, Qwen, Kimi, Doubao, and Ernie
- infra Running GLM-5.2 at Home: SGLang, vLLM, Transformers, and KTransformers Setup Guide
- industry Fable 5 Credit Cliff: What the June 23 Billing Shift Means for Teams
- devtools Claude Code Plugins: Anthropic's Official Plugin Ecosystem Explained
- devtools Running GLM-5.2 in Cursor, Cline, and Roo Code: Migration Checklist and Gotchas
- devtools Cursor Goes to SpaceX, Windsurf to Cognition: What Changes for Dev Teams
- jun 25 security OpenAI's TanStack npm Writeup Shifts Dependency-Control Burden onto AI Tooling Teams
- jun 25 devtools Vercel Detects Bun Lockfiles for Affected Builds as Text bun.lock Stabilizes
- jun 25 industry Apple Raises Mac and iPad Prices as AI Memory Demand Drains DRAM Supply
- jun 25 infra Vercel's Anti-Lock-In Pitch: What the Open-Source Bet Still Locks In
- jun 25 oss Emotion Vectors Replicate in Open-Source LLMs, but Steering Is Unproven
- jun 25 models Does Tree-of-Thought Reasoning Scale to Billion-User Modeling?
- jun 25 agents Do AI Agents Hold Up Outside Familiar Environments? A New Eval Says No
- jun 25 infra Vercel Adds Tag-Based CDN Cache Invalidation: Surrogate Keys at the Edge
- jun 25 agents How Much Repo Structure Does a Coding Agent Actually Need?
- jun 25 agents MCP vs A2A: Two Agent Protocols, One Integration Layer Decision
- jun 25 oss Open-Source AI Adoption Index Uses Chat Logs and O*NET Data to Replicate Frontier-Lab Studies
- jun 25 infra GLM 5.2 Fast on Vercel AI Gateway: What Routing Through Wafer Actually Buys
- jun 25 industry OpenAI Pushes Its IPO Into 2027, Clearing the Lane for Anthropic's S-1
- jun 25 infra Vercel CDN Cache Tags vs Path Purging: When Tag Invalidation Wins
- jun 25 infra Prisma Joins the Vercel Marketplace: The ORM Becomes the Database Vendor
- jun 25 security OpenAI's ChatGPT Atlas Treats Prompt Injection as Unfixed, Not Patched
- jun 25 devtools Vercel CLI Now Signs Blob URLs: Moving Access Control Off the App Server
- jun 25 oss OpenKnowledge Keeps Markdown Local but Routes the Vault to Cloud Coding Agents
- jun 25 models Can LLMs Debug Verilog? VeriPilot Puts an Agent on RTL Errors
- jun 25 devtools Buying Domains From the Vercel CLI: What Domain Search Folds Into Deploys
- jun 25 infra OpenAI on AWS Bedrock: Routing Math to Run Before You Move Traffic
- jun 24 infra Vercel's Function Observability: What Native Metrics Replace and What They Don't
- jun 24 infra AWS Databases on the Vercel Marketplace: The Cross-Cloud Latency Tax
- jun 24 agents Can You Rewind an AI Agent Mid-Run? Reversible Traces Say Yes
- jun 24 models Task Decomposition Helps LLMs by Shrinking Output Space, Not by Cutting Labeling Cost
- jun 24 agents Can AI Agents Reproduce Published Research? CORE-Bench Tests It
- jun 24 security Can Provable Bounds Defend LLM Fine-Tuning Against Poisoned Data?
- jun 24 devtools Yarn Berry on Vercel: A Build-Cache Gap With No Documented Fix
- jun 24 infra Turso on the Vercel Marketplace: Edge SQLite vs the Serverless Connection Pool
- jun 24 devtools SvelteKit Can Run NextAuth.js, but Auth.js Moved to Better Auth
- jun 24 agents How On-Device AI Agents Keep Learning by Forgetting on Purpose
- jun 24 devtools Fired for Building the Google Workspace CLI: The Risk of Depending on Unofficial Vendor Tools
- jun 24 models Flow Matching vs U-Net: A Skip-Free Backbone for Speech Models
- jun 24 security Measuring LLM Safety by Refusal Alignment Instead of Attack Success Rate
- jun 24 security Poisoning Physics-Informed Neural Networks Slips Past Loss-Based Validation
- jun 24 policy 50 Years of Aviation Certification Expose a Structural Gap in AI Governance
- jun 24 security Catching LLM Jailbreaks by Watching Per-Layer Entropy, Not Outputs
- jun 24 oss Cost and Access, Not Ideology, Drive Open-Weight Chinese Model Adoption
- jun 24 models A Per-Neuron Sequence Model Was Withdrawn From arXiv as Coverage Hailed It
- jun 24 policy Do Reasoning Tokens Actually Make LLMs Safer? A New Paper Tests It
- jun 24 devtools Nub Bundles a Bun-Style Toolkit Onto Node Without the Runtime Swap
- jun 24 oss Bot-Account Lookups Miss 97% of AI Coding Agent Commits, 180M-Repo Census Finds
- jun 24 security How Reliable Are the LLM Judges Scoring Jailbreak Attacks?
- jun 24 models PV-TAM Corrects Decoding Drift and Boundary-Marker Bias in VLM Localization Scoring
- jun 24 agents Do AGENTS.md Files Actually Help Coding Agents? A New Benchmark Tests It
- jun 24 agents Should AI Shopping Agents Pay Micro-Transactions for Verified Product Data?
- jun 24 models Meituan's General 365 Benchmark: Top Models All Score Under 63%
- jun 24 models LLM Surrogates in A/B Tests: The 39% Recovery Gap and the Silent Bias Risk
- jun 24 models LLM Token Pricing vs Compute Cost: What the Tokenomics Math Shows
- jun 24 models Do LLM Judges Favor Their Own Output? A Sanity Check on Self-Preference