hackernews client

Show HN: Create LLM graders and run evals in JavaScript with one file

28 pointsposted 3 days ago

2 Comments

rbalicki

3 days ago

Very cool! This lets you grade output across different base models. Does it also allow you grade output across different prompts?

randall

3 days ago

that’s the next step… we have a structured approach to prompting too that we think will help people build better prompts too.