Compiler optimizations for 5.8ms GPT-OSS-120B inference (not on GPUs)

1 pointsposted 20 hours ago
by olibaw

No comments yet