Hackernews
new
show
ask
jobs
Llama.cpp: Deterministic Inference Mode (CUDA): RMSNorm, MatMul, Attention
6 points
posted 5 months ago
by diwank
(github.com)
No comments yet