Topic

#training

2 articles exploring training. Expert insights and analysis from our editorial team.

Showing 1–2 of 2 articles

Articles

Newest first
AI Models

DeepSeek V3/R1: How Chinese Engineers Matched GPT-4 for $6 Million

DeepSeek's V3 and R1 models match GPT-4-class performance using a fraction of the compute through architectural innovations in Mixture of Experts, attention compression, and reinforcement learning—demonstrating that training efficiency may matter more than raw hardware scale.

· 10 min read
Model Training

Fine-Tune LLMs 2x Faster with 70% Less VRAM: The Unsloth Guide

Discover how Unsloth's Triton-optimized kernels enable 2x faster LLM fine-tuning with 70% less VRAM, making it possible to train DeepSeek, Qwen, and Llama models on consumer GPUs without sacrificing accuracy.

· 8 min read