Hackernews
new
show
ask
jobs
Microsoft ArchScale: Simple and Scalable Pretraining
3 points
posted 7 months ago
by tosh
(github.com)
No comments yet