LLM rerankers for production RAG: tips and tricks

4 pointsposted 7 hours ago
by mathcircler

1 Comments

alexpivnenko

6 hours ago

Surprised that removing spaces actually had such a big effect on latency.

Also props for including the prompt and AB results