rbalicki
3 days ago
Very cool! This lets you grade output across different base models. Does it also allow you grade output across different prompts?
randall
3 days ago
that’s the next step… we have a structured approach to prompting too that we think will help people build better prompts too.