Topic

#llama

2 articles exploring llama. Expert insights and analysis from our editorial team.

Showing 1–2 of 2 articles

Articles

Newest first
Models & Research

DuQuant++ Brings Fine-Grained Rotation to FP4: What Microscaling Quantization Means for Running Larger Models on the Same GPU

DuQuant++ adapts outlier-aware rotation to MXFP4, halving online rotation cost on LLaMA 3 and shifting the FP4 deployment bottleneck from memory to calibration engineering.

Models & Research

Qwen 2.5 vs Llama 3.3: The Open-Weight Showdown Nobody Is Talking About

Alibaba's Qwen 2.5 beats Meta's Llama 3.3 on math, multilingual tasks, and structured data — yet gets a fraction of the Western press coverage.

· 8 min read