Why text-based evals fail for vision-language models

1 pointsposted 2 days ago
by nikhilpareek13

Item id: 46508362

No comments yet