Hackernews
new
show
ask
jobs
Long-Context Attention from Kernel Efficiency to Distributed Context Parallelism
1 points
posted 11 hours ago
by PaulHoule
(arxiv.org)
No comments yet