Show HN: From Photos to Positions: Prototyping VLM-Based Indoor Maps

55 pointsposted 7 months ago
by accurrent

2 Comments

rohanrao123

7 months ago

Pretty cool! It reminded me of this work from NVIDIA Research - https://nvidia-ai-iot.github.io/remembr where they used VLMs and RAG on top of a real robot to navigate the Voyager campus in Santa Clara. You also might like the new OpenAI o3 models and how well they can play GeoGuessr ;)

https://simonwillison.net/2025/Apr/26/o3-photo-locations, https://news.ycombinator.com/item?id=43835044, https://www.astralcodexten.com/p/testing-ais-geoguessr-geniu...

accurrent

7 months ago

Yep I've seen the NVidia research stuff. It's pretty cool.