Hackernews
new
show
ask
jobs
Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs)
9 points
posted 4 months ago
by olibaw
(furiosa.ai)
No comments yet