Category
Models & Research
Foundation models, releases, benchmarks, and AI research.
33 articles exploring Models & Research. Expert analysis and insights from our editorial team.
Showing 31–33 of 33 articles
· Page 3 of 3
Latest in Models & Research
Newest first
31 32 33
Two Different Tricks for Fast LLM Inference: Speeding Up AI Responses
Speculative decoding and efficient memory management through PagedAttention are two proven techniques that accelerate LLM inference by 2-24x without sacrificing output quality, enabling production deployments at scale.
Fine-Tune LLMs 2x Faster with 70% Less VRAM: The Unsloth Guide
Discover how Unsloth's Triton-optimized kernels enable 2x faster LLM fine-tuning with 70% less VRAM, making it possible to train DeepSeek, Qwen, and Llama models on consumer GPUs without sacrificing accuracy.
The Best AI Models for OpenClaw in 2026
A comprehensive guide to selecting the right LLM for your OpenClaw workflows, covering coding, writing, reasoning, and cost-effective options.