vunderba
5 hours ago
Nice job - but it's definitely not abandonware having been re-released on Steam [1] (along with a sequel) back in 2015.
Regarding the verifier that plays against the live engine, I’ve approached the problem from a similar angle by having LLM agents effectively borrow a page from the speedrunning community in the form of tool-assisted speedruns, allowing the LLM access only to a virtualized game controller.
[1] - https://store.steampowered.com/app/346850/Chips_Challenge_1
kenblum
5 hours ago
Good catch on the Steam re-release. I'll add a note, or pull the site if it comes to that.
Curious about your agent setup though. Any public repo?
vunderba
5 hours ago
Even if you do pull the game itself I would still definitely leave all the post mortem stuff up. I think it's just as interesting and worth keeping around - especially the YT vids demonstrating the harness.
I don't have a GH repo up for the TAS system yet - it's a bespoke mess right now since it was built with the old game "Castle of the Winds" in mind but I'll definitely consider it in the future!