Show HN: Gemma 3 inference in pure C++ with Metal acceleration

3 pointsposted 10 hours ago
by ybubnov

2 Comments

k1r111

8 hours ago

Looks really cool, thank you. I can't find anything about performance. Is it faster? Or is it just a cool demo?

ybubnov

2 hours ago

That’s in my short list of next things to do. In the recent releases my primary focus was on compact size of the executable and modern C++ API.