models
models & research
more in this beat
- may 24 models μP Hyperparameter Transfer Has an Embedding Layer Hole, New arXiv Paper Says
- may 23 models Project Glasswing One Month In: AI Bug Discovery Has Outpaced the Patch Pipeline
- may 22 models arXiv 2605.16428 Measures AI Search's Drag on Publisher Traffic Using Paired Google and Reddit Data
- may 22 models A Theory of Time-Sensitive Language Generation Says Sparse Hallucination Beats Mode Collapse
- may 18 models The Last Word Often Wins: A Format Confound Inflates Chain-of-Thought Corruption Robustness Scores
- may 17 models Learning, Fast and Slow: What arXiv 2605.12484 Proposes for LLMs That Adapt Continually
- apr 27 models There Will Be a Scientific Theory of Deep Learning: What arXiv 2604.21691 Argues and Where It Will Lose
- apr 22 models Qwen3.6-27B's Dense Architecture Challenges the MoE-Only Playbook for Flagship-Class Coding Models
- mar 23 models Chinese AI Models Compared: DeepSeek, Qwen, Kimi, Doubao, and Ernie
- mar 23 models Running DeepSeek R1 Locally: Hardware Requirements, Quantization, and Real Throughput
- mar 14 models Fish-Speech: The Open-Source TTS Model That's Threatening ElevenLabs
- feb 26 models Synthetic Data Is Eating AI Training
- feb 26 models Google's TimesFM: A Foundation Model for Time Series
- feb 26 models Gemini 2.0 Pro's 2 Million Token Context: What Can You Actually Do With It?
- feb 26 models DeepSeek V3/R1: How Chinese Engineers Matched GPT-4 for $6 Million
- feb 26 models Claude's Web Search Changes Everything for AI Research
- feb 26 models The Million-Token Context Window: What Can You Actually Do?
- feb 18 models Gemini 3.1 Pro: Google's New Reasoning Model Explained
- feb 17 models Kimi Claw: Moonshot AI's Answer to Claude and ChatGPT
- feb 17 models WiFi DensePose: Full-Body Tracking Through Walls Using Your Router
- feb 14 models AI Code Generation Benchmarks 2026: Which Model Actually Writes Better Code?
- feb 10 models The Best AI Models for OpenClaw in 2026