Hackernews
new
show
ask
jobs
Accelerating LLM Inference on AMD GPUs with Low-Latency GEMMs
2 points
posted 8 hours ago
by matt_d
(rocm.blogs.amd.com)
No comments yet