Show HN: Deterministic PCIe Diagnostics for GPUs on Linux

15 pointsposted 10 hours ago
by gpu_systems

4 Comments

AuthAuth

8 hours ago

This is great. Are there any features you are looking to add? Would checking for bad memory blocks be useful? I've never seen it happen on a GPU but surely it must.

wtallis

9 hours ago

Is this entirely NVIDIA-specific, or can it do any diagnostics for other GPUs?

kimixa

6 hours ago

It's very much nvidia specific, not just using CUDA but the backing nvidia-specific management libraries.

Though I don't think there's anything particularly device-specific they're measuring, they're using the private nvidia interfaces to do so.

cr125rider

3 hours ago

OP you should call that out a little more clearly that this is Nvidia only.