Multi-Images Multi-Audio Multi-turn Multi-Modal bilingual TinyLlama

SigClip Encoder + Whisper Encoder + TinyLlama, source code at https://github.com/mesolitica/multimodal-LLM

Downloads last month
154
Safetensors
Model size
1.62B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support