Hackernews
new
show
ask
jobs
Microsoft ArchScale: Simple and Scalable Pretraining
3 points
posted 12 hours ago
by tosh
(github.com)
No comments yet