Hackernews
new
show
ask
jobs
SVDQuant: 4-Bit Quantization Powers 12B Flux on a 16GB 4090 GPU with 3x Speedup
178 points
posted 6 days ago
by lmxyy
(hanlab.mit.edu)
No comments yet