Hackernews
new
show
ask
jobs
Rethinking Language Model Scaling Under Transferable Hypersphere Optimization
1 points
posted 10 hours ago
by matt_d
(arxiv.org)
No comments yet