Hackernews
new
show
ask
jobs
Efficient Pre-Training with Token Superposition
2 points
posted 8 hours ago
by pyinstallwoes
(nousresearch.com)
No comments yet