hackernews client

Hackernews new show ask jobs

Accelerating LLM Inference on AMD GPUs with Low-Latency GEMMs

2 pointsposted 8 hours ago

by matt_d

(rocm.blogs.amd.com)

No comments yet