Content
@
0 reply
0 recast
0 reaction
ππͺπΎπ‘π‘πΎ
@gm8xx8
Mistral released Pixtral 12B, a Vision Language Model with a 12B text backbone and 400M vision adapter. It supports larger vocabularies, new image tokens, processes 1024x1024 images, and uses bf16 weights. Mistral back at it again π₯ π: https://x.com/mistralai/status/1833758285167722836?s=46 π€: https://huggingface.co/mistral-community/pixtral-12b-240910
0 reply
0 recast
6 reactions