Bringing Up DeepSeek-V4-Flash on AMD MI300X

72 pointsposted 7 hours ago
by kkm

7 Comments

maCDzP

4 hours ago

I train on AMD MI250X and managed to get Gemma 4 31B to work - but it took a lot of work on the software side.

kkm

4 hours ago

This is very interesting, planning to write about it?

mezark

5 hours ago

We at doubleword are bullish for AMD for low-interactivity inference - it does just take a bigger lift on the software side...

latchkey

10 minutes ago

Nice work and thanks for being a customer.

(CEO Hot Aisle)

benlm

5 hours ago

Nice work! Would DeepSeek V4 Pro on 8xMI300X work with these patches?