Hackernews
new
show
ask
jobs
Token-Count-Based Batching: Faster, Cheaper Embedding Inference for Queries
1 points
posted 2 days ago
by fzliu
(mongodb.com)
No comments yet