Show HN: HelixDB – A graph database built on object storage

45 pointsposted 4 hours ago
by GeorgeCurtis

23 Comments

caust1c

7 minutes ago

Where's the source code for the database itself? Looks like the repo is just a client.

Congrats on the launch!

GeorgeCurtis

3 minutes ago

This was a TEMPORARY decision we made, and I wrote a bit about why we did this here: https://x.com/georgecurtiss/status/2060043184059912470

We’re 100% committed to going back to open-source on an Apache 2.0 license as soon as possible. In the meantime, you can continue to deploy us completely for free, however you like, using the compiled docker container.

cjlm

an hour ago

Currently #5 on gdb-engines.com - definitely worth a look.

GeorgeCurtis

an hour ago

yooo this is awesome. Didn't even realise :)

mentioum

4 hours ago

We've been having some issues with intermittent performance on multi hop queries.

What's your p99 like for multi hops?

zw17

3 hours ago

If your use case is OLAP based, please check it out PuppyGraph. It’s a graph query engine that sits on top of your Lakehouse (no ETL required). Our benchmark has shown consistently that 10-hop queries across billions of edges in <2 seconds. Our customers including some most data demanding companies like Coinbase, Datadog, Palo Alto Network, Netskope, AMD, etc.

mentioum

3 hours ago

It's not, its actually our prod db with direct user usage - we self host a large dgraph cluster. We have a very large number of people manage their car and car histories with us and host a full replica of the UK MOT Database.

We're fine with clickhouse and redshift for the OLAP work we do. I've been looking at ParaQuery lately if I really want to speed that up.

GeorgeCurtis

3 hours ago

This sounds like a perfect usecase. Would love to learn more and see if we can help!

email us: founders@helix-db.com

GeorgeCurtis

3 hours ago

PuppyGraph is a good fit for OLAP for sure.

We’re just two young founders sharing what we’ve been building, so I’ll take the drive-by competitor plug as a compliment :)

Definitely a different focus though. Helix is OLTP, built for operational graph + vector workloads, especially apps/agent memory where low-latency traversals and writes are concerned.

GeorgeCurtis

4 hours ago

In prod we see p99’s of <10ms ms for warm queries and around 50ms per hop for cold queries.

mentioum

3 hours ago

Hmmm... I'll get in touch. Got an email i can reach out to, there doesn't seem to be one listed on your website?

I'm more concerned about if the p99s stay consistent when things get spikey.

dgraph is fine otherwise...

GeorgeCurtis

an hour ago

Sure! You can email me personally at george@helix-db.com

rajit

an hour ago

when will the graph memory layer be available?

maxrumpf

4 hours ago

does it support fts/vector on edges of the graph?

GeorgeCurtis

4 hours ago

Yes you can put vectors, full text data, secondary and range indexes on both nodes and edges.

brene

4 hours ago

How does this compare vs. Turbopuffer?

GeorgeCurtis

4 hours ago

We see comparable results for vectors and FTS.

For vector search we have warm and cold p99s of approx 20ms and 400ms respectively. For FTS, warm and cold query p99s of approx 15ms and 250ms respectively.

Both of these benchmarks were run on 1m docs.

raufakdemir

3 hours ago

what language does this support? cypher/gremlin?

GeorgeCurtis

3 hours ago

We don't support cypher or gremlin. We can

You can query HelixDB using JSON or directly in your programming language of choice by using our Rust, TypeScript, Go or Python SDKs. We’ve found AI is very good at working with the SDKs and JSON itself to query, making the development experience much better than before: https://docs.helix-db.com/database/querying

Bnjoroge

2 hours ago

congrats! how does this compare to turbopuffer, surreal or other multi-model ones built on object storage or not

GeorgeCurtis

an hour ago

tpuffer is a vector/fts database. Surreal is a bit of an "everything database".

We're a graph database with vector and FTS capabilities. Our vector and FTS benchmarks are comparable with tpuffer, but you would primarily use us for building whole applications, knowledge graphs, or AI memory/retrieval. Anything that is relationship intense.

Let me know if this properly answers your question