JanitorBench: A new LLM benchmark for multi-turn chats

26 pointsposted 3 months ago
by shep101

29 Comments

tomhow

3 months ago

We've killed all the green accounts that voted and commented on this, and moved the comments to a stub to hide them. We've also killed the post.

The HN guidelines and FAQ make it clear that it's not OK to ask people to upvote or comment on your stuff:

https://news.ycombinator.com/newsguidelines.html

https://news.ycombinator.com/newsfaq.html

And the HN community is very quick to notice it, flag the post and comments, and email us, all of which happened here.

There's no need to try and game HN like this. We're always looking for interesting new projects to showcase via Show HN, and we routinely spend substantial amounts of time helping people polish their Show HN posts, and if we think the community may find it interesting, we'll put it in the second chance pool (https://news.ycombinator.com/pool, explained here https://news.ycombinator.com/item?id=26998308), which guarantees it a bit of front page time, without the bad vibes and negative sentiment you attract from trying to game HN.

shep101

3 months ago

hey really sorry about that i really shouldnt have told people to upvote on discord, would it be possible to make a new post? i will not announce anywhere else

sparcpile

3 months ago

You should take the L on this one. Even if you got a "Show HN" post about JAI, I don't think the tech bros are going to be keen on a NSFW AI chatbot website. You still don't have a subscription offering after 2+ years. The users still wonder who in the hell pays to keep the lights on.

tomhow

3 months ago

[stub for green-account comments]

user

3 months ago

[deleted]

Sahadia

3 months ago

[dead]

hugorsmith

3 months ago

you can see jllm benchmarks on the table as well ('janitor-llm')

user

3 months ago

[deleted]