Token-Count-Based Batching: Faster, Cheaper Embedding Inference for Queries

1 pointsposted 2 days ago
by fzliu

No comments yet