srush
9 months ago
I made these a couple of years ago as a teaching exercise for https://minitorch.github.io/. At the time the resources for doing anything on GPUs were pretty sparse and the NVidia docs were quite challenging.
These days there are great resources for going deep on this topic. The CUDA-mode org is particularly great, both their video series and PMPP reading groups.
nextos
9 months ago
Slightly offtopic, but any chance you could update or re-upload code for your https://github.com/harvardnlp/DeepLatentNLP tutorial? I found the NLP latent variable models discussed there really interesting, and notebooks were excellent. However, these seem gone and the only thing left are slides?
Alternatively, any other places that discuss the same topics, including some code? I could only find equivalent discussions with code in Pyro docs and Kevin Murphy's book, volume 2. But these are more sparse as they also cover many other topics.
bytepoet
9 months ago
Thanks a lot, Sasha, for creating these. I found your LLM training puzzles to be excellent as well.
srush
9 months ago
Awesome! Here are all of them if anyone else is looking.
https://github.com/srush/Triton-puzzles https://github.com/srush/tensor-puzzles https://github.com/srush/autodiff-puzzles https://github.com/srush/transformer-puzzles https://github.com/srush/GPTworld https://github.com/srush/LLM-Training-Puzzles
lins1909
9 months ago
Thanks Sasha - this looks like a great resource.Just to be clear, would you recommend going through other newer resources than this instead?
Not sure if your comment is to discourage someone from going through this.
olive247
9 months ago
Do you have links to the other great resources you are referring to?