New Mistral AI Weights

55 pointsposted a year ago
by tymscar

16 Comments

danielhanchen

a year ago

Downloaded params.json - GeLU & 2D RoPE are used for the vision adapter. The vocab size also got larger - 131072 in size.

Also Mistral's latest tokenizer PR shows 3 extra new tokens (the image, the start & end).

The torrent is 24GB, and I guess the implementation will be up in HF in a few days!

Exciting times!

generalizations

a year ago

At 24gb, I wonder how small the quantized models will go.

WhatsName

a year ago

The name ist pixtral_12b_240910, size is 24gb. My best guess would be it's a multi modal LLM.

fsndz

a year ago

[flagged]

19h

a year ago

You should explicitly mention that this is your blog post, and that this is a "members-only" post?

user

a year ago

[deleted]