Hackernews
new
show
ask
jobs
Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs)
1 points
posted 20 hours ago
by olibaw
(furiosa.ai)
No comments yet