New Mistral AI Weights

55 pointsposted 3 days ago
by tymscar

16 Comments

danielhanchen

3 days ago

Downloaded params.json - GeLU & 2D RoPE are used for the vision adapter. The vocab size also got larger - 131072 in size.

Also Mistral's latest tokenizer PR shows 3 extra new tokens (the image, the start & end).

The torrent is 24GB, and I guess the implementation will be up in HF in a few days!

Exciting times!

generalizations

2 days ago

At 24gb, I wonder how small the quantized models will go.

WhatsName

3 days ago

The name ist pixtral_12b_240910, size is 24gb. My best guess would be it's a multi modal LLM.

fsndz

3 days ago

[flagged]

19h

2 days ago

You should explicitly mention that this is your blog post, and that this is a "members-only" post?

user

2 days ago

[deleted]