Hackernews
new
show
ask
jobs
Llama.cpp: Deterministic Inference Mode (CUDA): RMSNorm, MatMul, Attention
4 points
posted 12 hours ago
by diwank
(github.com)
No comments yet