Content pfp
Content
@
0 reply
0 recast
0 reaction

๐š๐”ช๐Ÿพ๐šก๐šก๐Ÿพ pfp
๐š๐”ช๐Ÿพ๐šก๐šก๐Ÿพ
@gm8xx8
Mamba-Hybrid ecosystem is growing Zyphraโ€™s Zamba-7B: - a 7B Mamba/Attention hybrid model - matches Mistral-7B & Gemma-7B performance w/ only 1T open training tokens - surpasses Llama-2 7B & OLMo-7B - set to release all training checkpoints under Apache 2.0 this model looks very strong. https://www.zyphra.com/zamba
0 reply
0 recast
5 reactions

LoadingALIAS pfp
LoadingALIAS
@loadingalias
Hey, has anyone used a mamba model? Feedback?
0 reply
0 recast
0 reaction