Token-Count-Based Batching: Faster, Cheaper Embedding Inference for Queries

1 pointsposted a day ago
by fzliu

No comments yet