Hackernews
new
show
ask
jobs
Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery [pdf]
2 points
posted 7 hours ago
by gmays
(research.nvidia.com)
No comments yet