Groundy — independent coverage of developer tools, infrastructure, and platforms
Can You Stitch Two Foundation Models Together Without Retraining?
Splicing layers from independently trained foundation models fails without targeted training at the join point. A two-stage recipe called Final Feature Matching makes it work.
infraCloudflare Acquires VoidZero, the Company Behind Vite's Rust Toolchain
Cloudflare acquired VoidZero, putting Vite, Rolldown, and Oxc maintainers on a deploy-target vendor's payroll. MIT licensing stays. Roadmap neutrality is the open question.
Jailbreak Suffixes Hit Harder at Specific Token Positions, New GCG Variant Shows
SlotGCG shows adversarial token position, not just content, determines jailbreak success, with 14% higher attack rates and 42% higher rates against defended models.
policyWhen Should an LLM Forget You? A Benchmark for Deciding What Memory to Drop
PersistBench finds LLMs mishandle persistent memory 53 to 97 percent of the time. Unlearning suppresses rather than erases user data, making GDPR compliance unverifiable.
securityOpenAI Adds Lockdown Mode to ChatGPT, Shifting Prompt-Injection Risk to Users
OpenAI's Lockdown Mode disables agentic features builders rely on rather than fixing prompt injection at runtime, forcing a binary choice between security and capability.
policyWhen RL Training Rewards Capability-Seeking: A New Alignment Risk
A June 2026 ICML paper shows RL optimizers can push language models to exploit reward loopholes the task never required, while standard performance metrics hold steady.
modelsDo Reasoning LLMs Waste Tokens? OckBench Tries to Measure It
OckBench scores 37 reasoning LLMs on token efficiency alongside accuracy, finding comparably accurate models differ by up to 26× in token cost under per-token billing.
securityActivation Steering Was Sold as LLM Control. New Work Makes It an Attack Surface
Poisoning 4-6% of tokens in a steering dataset silently inverts refusal vectors into jailbreaks, achieving 20-55% ASR. Shared vector bundles are the attack surface.
- models Opus 4.8 vs Opus 4.7: What Changed and What Did Not
- agents Claude Code, Cursor, Copilot: How Agentic Coding Assistants Get Weaponized as Attacker Shells
- devtools Anthropic Buys Stainless: OpenAI and Google Now Depend on a Rival for SDK Tooling
- agents A New Trust Schema Exposes Why Agent Skill Registries Fail Enterprise Audit Requirements
- policy FTC's TAKE IT DOWN Act Lands May 19: 48-Hour Deepfake NCII Takedowns and No Safe Harbor
- devtools GitHub Copilot vs Cursor vs Claude Code: The 2026 AI Coding Showdown
- devtools Claude Code Plugins: Anthropic's Official Plugin Ecosystem Explained
- infra MLX vs llama.cpp on Apple Silicon: Which Runtime to Use for Local LLM Inference
- models AI Code Generation Benchmarks 2026: Which Model Actually Writes Better Code?
- devtools GitHub Copilot's Opus 4.7 Multiplier: 7.5x to 15x to 27x in 60 Days
- models Chinese AI Models Compared: DeepSeek, Qwen, Kimi, Doubao, and Ernie
- devtools Claude Code in GitHub Actions: A Complete Guide to Automated PR Fixes
- devtools GitHub Copilot Replaces Premium Request Units With Token-Metered AI Credits on June 1
- culture EU's 2027 Replaceable Battery Mandate: What It Means for Phone Buyers and Repairers Right Now
- industry Cursor's Meteoric Rise: From $300M to $3B ARR in a Year
- jun 04 models Can You Stitch Two Foundation Models Together Without Retraining?
- jun 04 infra Cloudflare Acquires VoidZero, the Company Behind Vite's Rust Toolchain
- jun 04 security Jailbreak Suffixes Hit Harder at Specific Token Positions, New GCG Variant Shows
- jun 04 policy When Should an LLM Forget You? A Benchmark for Deciding What Memory to Drop
- jun 04 security OpenAI Adds Lockdown Mode to ChatGPT, Shifting Prompt-Injection Risk to Users
- jun 04 policy When RL Training Rewards Capability-Seeking: A New Alignment Risk
- jun 04 models Do Reasoning LLMs Waste Tokens? OckBench Tries to Measure It
- jun 04 security Activation Steering Was Sold as LLM Control. New Work Makes It an Attack Surface
- jun 04 culture Can Teaching Logical Fallacies Inoculate People Against AI Misinformation?
- jun 04 devtools Vercel Ships Experimental Native CLI Binaries to Cut the Node Startup Tax
- jun 04 security Catching LLM Agents Leaking Credentials From Their Own Activations
- jun 04 policy Refusal Steering Targets Individual Experts in MoE LLMs
- jun 04 infra Putting a Datacenter V100 in a Gaming PC: The Local LLM Math
- jun 04 devtools Vercel Rebuilds Its Marketplace CLI for Agents Instead of Humans
- jun 04 security The 2026 npm Attacks Proved AI Coding Assistants Are a Supply-Chain Target
- jun 03 security ChatGPT's New Lockdown Mode Borrows Apple's Name for a Prompt-Injection Kill Switch
- jun 03 agents When MCP Tool Descriptions Don't Match the Code, Agents Trust the Lie
- jun 03 security Students Are Prompt-Injecting AI Graders to Score Full Marks
- jun 03 devtools Malicious npm Packages Hit Red Hat's Published JavaScript Clients
- jun 03 policy Stacked Org Policies in LLM Chatbots Break Where Rules Collide
- jun 03 security Removing an LLM Backdoor Post-Training Without the Poisoned Data
- jun 03 models Which Layer Detects LLM Hallucinations Best? The Case Against Fixed-Layer Probes
- jun 03 policy Why Fine-Tuning Strips Safety Alignment From Open-Weight LLMs
- jun 03 security Stored Prompt Injection Now Persists Across AI Agent Sessions
- jun 03 industry MiniMax M3 Bundles 1M Context and Native Multimodal Into One Open-Weight Model
- jun 03 security LLM Data Poisoning Survives the Data-Cleaning Defenses Built to Stop It
- jun 03 devtools OpenAI Upgrades Codex Right as Teams Weigh Leaving Claude Code
- jun 03 policy Game Theory vs RLHF: Modeling LLM Safety Alignment as a Non-Cooperative Game
- jun 03 infra Cost-Aware RAG Routing: When Deeper Retrieval Stops Paying Off
- jun 02 devtools GitHub Copilot Moves to a Platform App, Decoupling From the Editor
- jun 02 infra Using Your Nvidia GPU's VRAM as Linux Swap: Where the NBD Hack Breaks Down
- jun 02 security Why OpenAI Bets on Instruction Hierarchy to Stop Prompt Injection
- jun 02 policy Explainability Mandates Leak Graph Models to Their Attackers
- jun 02 security Stopping Multi-Turn LLM Jailbreaks Without Retraining the Model
- jun 02 security African Languages Are a Jailbreak Blind Spot for English-Tuned LLM Safety
- jun 02 devtools How a VSCode Bug Let One Click Steal Your GitHub Token
- jun 02 agents When an AI Agent Causes a Loss, Who Files the Insurance Claim?
- jun 02 models Cross-Domain RL Training Degrades Capabilities. CARE-RL Reweights to Fix It
- jun 02 agents When Agent Skill Libraries Scale, Dependency-Aware Retrieval Beats Flat Search
- jun 02 policy Evolutionary Search Finds LLM Jailbreak Classes That Static Red-Teaming Misses
- jun 02 security Poisoning Open-Source LLM Merges: One Bad Checkpoint Hijacks the Result
- jun 02 agents Can Instruction-Tuned Retrievers Fix Agentic Search's Retrieval Gap?
- jun 02 models LLM Watermarking Without Quality Loss: The Non-Distortionary Approach
- jun 02 security An Autonomous Research Agent Now Discovers SOTA LLM Jailbreak Attacks
- jun 02 devtools GitHub Copilot and Productivity: What an Observational Dose-Response Study Measures
- jun 02 policy Why AI Red-Teaming Rediscovers the Same Jailbreaks and Misses the Rest
- jun 02 industry Morningstar's $780B SpaceX Mark Undercuts the IPO Target by Half
- jun 02 security Malware Can Prompt-Inject the AI Agent Reverse-Engineering It
- jun 02 agents Bandit-Based Prompt Optimization Targets Multi-Agent Systems Like CrewAI and AutoGen
- jun 02 security CVE-Factory Turns Published CVEs Into Security Agent Training Data. A 32B Model Beats Claude 4.5 Sonnet.