lukax
7 hours ago
Is this Triton's reply to NVIDIA's tilus[1]. Tilus is suposed to be lower level (e.g. you have control over registers). NVIDIA really does not want the CUDA ecosystem to move to Triton as Triton also supports AMD and other accelerators. So with Gluon you get access to lower level features and you can stay within Triton ecosystem.
mdaniel
6 hours ago
Also it REALLY jams me up that this is a thing, complicating discussions: https://github.com/triton-inference-server/server
reasonableklout
6 hours ago
It sounds like they share that goal. Gluon is a thing because the Triton team realized over the last few months that Blackwell is a significant departure from the Hopper, and achieving >80% SoL kernels is becoming intractable as the triton middle-end simply can't keep up.
Some more info in this issue: https://github.com/triton-lang/triton/issues/7392
saagarjha
4 hours ago
I believe it’s the other way around; Gluon exposes the primitives Triton was built on top of.