Groundy — independent coverage of developer tools, infrastructure, and platforms
Zhipu Open-Sources GLM-5.2 Under MIT While Anthropic Tightens Model Access
Zhipu shipped GLM-5.2 with a 1M-token context window the day after the US ordered Anthropic to cut foreign access, with MIT-licensed weights promised within a week.
modelsCan Editing One Neuron Fix LLM Repetition Loops?
A June 2026 preprint localizes Gemma 4 repetition loops to a few MLP neurons and removes them with a one-time weight edit while benchmark scores hold.
Zhipu Ships GLM-5.2 With 1M Context and MIT Weights, but Zero Benchmarks at Launch
Zhipu shipped GLM-5.2 on June 13 with a 1M-token window and an Anthropic-compatible endpoint, but published no benchmarks and keeps the hosted API on a paid plan.
infraAWS Bedrock Now Requires Data Sharing for Mythos: The Self-Hosting Calculus
AWS Bedrock's provider_data_share gate for Mythos-class models removes the in-AWS data boundary regulated teams bought it for, pushing them toward self-hosted serving.
devtoolsVercel's Remend Turns Streaming-Markdown Repair Into a Dependency
Vercel's Remend packages the partial-markdown repair every streaming chat team hand-rolls, but the shared heuristic can mask upstream token-boundary defects.
industryMoonshot's Kimi K2.7 Code Loses 11 of 12 Benchmark Cells, Leads on Efficiency Instead
Moonshot's Kimi K2.7 Code loses 11 of 12 benchmark cells to GPT-5.5 and Opus 4.8, leading on token efficiency and price, which pushes buyers to run their own evals.
policyCan Reinforcement Learning Be Provably Safe Without Sacrificing Scale?
Two June 2026 preprints claim formal safety guarantees hold without a capability tax in low-dimensional robotic control, sharpening the attestation-versus-verification gap.
infravLLM Cold Start Latency: Why Scale-to-Zero LLM Serving Stalls
A June 2026 MLSys paper breaks vLLM cold start into six CPU-bound boot phases, showing why scale-to-zero serving forces operators back into warm GPU pools.
- policy US Export Order Forces Anthropic to Disable Fable 5 and Mythos 5 Worldwide
- infra MiniMax M3 Ships 1M Context and Desktop Control as Open Weights
- agents When AI Agents Delegate Work, Your Observability Stack Goes Blind
- models Claude Fable 5 vs Opus 4.8: When 2x Pricing Is Worth It
- models Opus 4.8 vs Opus 4.7: What Changed and What Did Not
- devtools GitHub Copilot vs Cursor vs Claude Code: The 2026 AI Coding Showdown
- models AI Code Generation Benchmarks 2026: Which Model Actually Writes Better Code?
- industry Cursor's Meteoric Rise: Inside the AI Editor Hitting $300M ARR
- models Chinese AI Models Compared: DeepSeek, Qwen, Kimi, Doubao, and Ernie
- infra MLX vs llama.cpp on Apple Silicon: Which Runtime to Use for Local LLM Inference
- devtools Claude Code Plugins: Anthropic's Official Plugin Ecosystem Explained
- industry OpenAI Offers Two Months of Free Codex to Enterprises Switching From Claude Within 30 Days
- devtools Claude Code in GitHub Actions: A Complete Guide to Automated PR Fixes
- culture EU's 2027 Replaceable Battery Mandate: What It Means for Phone Buyers and Repairers Right Now
- infra Prefill-Decode Disaggregation: The Architecture Shift Redefining LLM Serving at Scale
- jun 15 oss Zhipu Open-Sources GLM-5.2 Under MIT While Anthropic Tightens Model Access
- jun 15 models Can Editing One Neuron Fix LLM Repetition Loops?
- jun 15 industry Zhipu Ships GLM-5.2 With 1M Context and MIT Weights, but Zero Benchmarks at Launch
- jun 15 infra AWS Bedrock Now Requires Data Sharing for Mythos: The Self-Hosting Calculus
- jun 15 devtools Vercel's Remend Turns Streaming-Markdown Repair Into a Dependency
- jun 15 industry Moonshot's Kimi K2.7 Code Loses 11 of 12 Benchmark Cells, Leads on Efficiency Instead
- jun 14 policy Can Reinforcement Learning Be Provably Safe Without Sacrificing Scale?
- jun 14 infra vLLM Cold Start Latency: Why Scale-to-Zero LLM Serving Stalls
- jun 14 infra The Vercel-AWS Deal Reveals Where AI Inference Runs
- jun 14 agents Do Programming Languages Still Matter to Your AI Coding Agent?
- jun 14 agents Why Production AI Agents Fail Silently and Your Logs Never Catch It
- jun 13 security AMD Took 124 Days to Patch the RCE It First Called Out of Scope
- jun 12 policy US Export Order Forces Anthropic to Disable Fable 5 and Mythos 5 Worldwide
- jun 10 models Claude Fable 5 Benchmarks: What FrontierCode, CursorBench, and ViBench Show
- jun 11 agents Computer-Use Agents Fabricate Success on 8 to 33 Percent of Long-Horizon Tasks
- jun 10 infra Running RAG on a Snapdragon NPU: The On-Device Retrieval Tradeoff
- jun 10 models Does Attribution Patching Lie? A Fix for a Common Interpretability Shortcut
- jun 11 models Can You Make a Multimodal Model Unlearn With Activation Steering?
- jun 11 models Why Pruning a Model Can Raise Its Out-of-Distribution Accuracy
- jun 11 industry Vercel's Turborepo: Build Speed Becomes a Hosting-Vendor Feature
- jun 10 security OpenAI Frames Instruction Hierarchy as an Open Challenge, Not a Prompt-Injection Fix
- jun 10 devtools JetBrains Mellum2: A 12B Open-Weights Code Model for Self-Hosted Completion
- jun 09 models Do Unified Multimodal Models Actually Interleave Understanding and Generation?
- jun 09 agents Can AI Agents Share Context Without a Central Coordinator?
- jun 09 agents Why Skill Creation and Reward Optimization Collide in Agentic RL
- jun 09 infra GraphRAG vs VectorRAG: Does the Graph Index Earn Its Cost?
- jun 09 models How LLMs Track Who Did What: The Entity Rebinding Circuit
- jun 09 devtools Vercel's Chat SDK Targets Every Chat Platform From One Codebase
- jun 09 infra MiniMax M3 Ships 1M Context and Desktop Control as Open Weights
- jun 09 devtools NPM v12 Breaking Changes: Auditing Your Lockfiles Before the Upgrade
- jun 09 infra DeepSeek-V4 FlashMemory: Sparse Attention for Million-Token Context
- jun 09 agents When AI Agents Delegate Work, Your Observability Stack Goes Blind
- jun 09 models Claude Fable 5 vs Opus 4.8: When 2x Pricing Is Worth It
- jun 09 models Claude Mythos 5 Access Rules: Who Gets Project Glasswing and Why
- jun 09 policy Fable 5 Biology Classifiers: How Flagged Prompts Fall Back to Opus 4.8
- jun 09 industry Fable 5 Credit Cliff: What the June 23 Billing Shift Means for Teams
- jun 09 models Fable 5 Distillation Protection: How Anthropic Blocks Model Copying
- jun 09 models Skip Fable 5 or Upgrade? When Opus 4.8 and Sonnet 4.6 Are Still Enough
- jun 08 security Skill Injection: Hiding Undetectable Instructions in What an AI Agent Loads
- jun 08 models LLM Steganography: Can Defenders Detect Payloads Hidden in Model Output?
- jun 08 policy Who Gets to Audit Your Health Chatbot? Almost No One
- jun 08 policy Do Word-Subset Explanations Satisfy the EU AI Act's Transparency Rule?
- jun 08 infra Is Cloudflare's Bot Traffic Surge Real? The Measurement Dispute
- jun 08 industry OpenAI Pushes ChatGPT Into Compensation Data, Pressuring Mercer and Radford
- jun 08 policy Bit-Exact Inference Verification Gives AI Audits a Proof Mechanism
- jun 08 models Do Privacy Defenses Actually Protect Fine-Tuned LLMs? A New Benchmark
- jun 08 models Can You Reconstruct an LLM's System Prompt From Its Activations?
- jun 08 policy Can a Robot's Own Attention Flag Its Unsafe Actions Before They Run?
- jun 08 devtools Can a CLI Replace Screenshots for GUI Automation Agents?
- jun 08 agents Bloomberg's Pomona Makes Small Automated Code Changes, Not Big Agent PRs