Hackernews
new
show
ask
jobs
Writing an LLM from scratch, part 32g – Interventions: weight tying
1 points
posted 3 hours ago
by ibobev
(gilesthomas.com)
No comments yet