jacksonastone
11 hours ago
Made this in a free evening. Index an permissive license translation of the Bible (WEB) into a RAG database to allow returning passages of similar semantic meaning. Lots of fun. For example, "more money more problems" returns Ecclesiastes 5:9-13 which, I'll just say, is spot on..
"Moreover the profit of the earth is for all. The king profits from the field. He who loves silver shall not be satisfied with silver, nor he who loves abundance, with increase. This also is vanity. When goods increase, those who eat them are increased; and what advantage is there to its owner, except to feast on them with his eyes? The sleep of a laboring man is sweet, whether he eats little or much; but the abundance of the rich will not allow him to sleep. There is a grievous evil which I have seen under the sun: wealth kept by its owner to his harm."
Anyway - thought it was fun enough to share. It's slow and I vibe coded it so I haven't sorted out how to make it not take 15 seconds to vector search against the full 4GB index.
mcswell
11 hours ago
Slow, but interesting. I used the query "government" and got back passages in Romans 13 (as I expected), but also passages in Daniel and Ezra describing decrees by government officials, which made sense.
kordlessagain
7 hours ago
This is really cool...great job! It's a favorite pastime of mine to index various large corpora.
As for speed, this might help for code referencing: https://github.com/deepbluedynamics/lume
Blog post: https://deepbluedynamics.com/blog/lume-retrieval-primitives
I use a small local model to extract entities for the graph, but it's not necessary.
You can optionally use GTR-T5 which is a few years old now, but still good for generating fast and free embeddings. That step is only run once if you run it in hybrid mode.
Feel free to take and remix or use!
atmanactive
2 hours ago
For completeness, this should include all possible books, including Ethiopian, and then it should include a drop-down with pre-defined sets one could choose from (Protestant, Catholic, Orthodox...).
ReactiveJelly
10 hours ago
> The king profits from the field
For the solution, read Henry George!
yjftsjthsd-h
7 hours ago
Any chance you wrote it up anywhere? I'm somewhat interested in RAG and a worked example would be nice.
jacksonastone
2 hours ago
I'll just open source it, just a sec (keep in mind, vibe coded :D)
jnamaya
11 hours ago
The site is not loading for me! Interesting project
mcswell
11 hours ago
You have to wait awhile. It came up for me after 30 seconds or so.
jacksonastone
9 hours ago
it's basically running on a toaster, sorry
TeMPOraL
8 hours ago
If that toaster didn't have a plan already, then feeding it with monotheistic holy scripture will definitely give it some ideas.
jacksonastone
2 hours ago
I wonder if polytheistic toaster would be more or less... ambitious.
a96
5 hours ago
A toaster is just a death ray with a smaller power supply
cdong
8 hours ago
pg-vector with hnsw is still 15 seconds?
anamnesis
11 hours ago
[dead]