Topic

#rotation-based-quantization

1 article exploring rotation-based-quantization. Expert insights and analysis from our editorial team.

Showing 1–1 of 1 articles

Articles

Newest first
Models & Research

DuQuant++ Makes FP4 Quantization Practical for LLM Inference: What Fine-Grained Rotation Means for Blackwell Deployments

DuQuant++ aligns rotation block size with MXFP4 microscaling groups, halving preprocessing cost and pushing W4A4 accuracy close to FP8 as Blackwell FP4 Tensor Cores ship.