Hackernews
new
show
ask
jobs
Writing an LLM from scratch, part 32d – Interventions: adding attention bias
4 points
posted 13 hours ago
by gpjt
(gilesthomas.com)
No comments yet