Hackernews
new
show
ask
jobs
TPI-LLM: Serving 70B-Scale LLMs Efficiently on Low-Resource Edge Devices
2 points
posted 7 hours ago
by CrypticShift
(arxiv.org)
No comments yet