LLM rerankers for production RAG: tips and tricks

5 pointsposted 5 months ago
by mathcircler

1 Comments

alexpivnenko

5 months ago

Surprised that removing spaces actually had such a big effect on latency.

Also props for including the prompt and AB results