TPI-LLM: Serving 70B-Scale LLMs Efficiently on Low-Resource Edge Devices

2 pointsposted 7 hours ago
by CrypticShift

No comments yet