microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition โข Updated about 7 hours ago โข 7.35k โข 520
ArTST - Arabic Text Speech Transformer Collection Open source project for Arabic Speech Recognition and Generation โข 11 items โข Updated 3 days ago โข 8
google/siglip2-large-patch16-256 Zero-Shot Image Classification โข Updated 8 days ago โข 5.79k โข 1
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 10 days ago โข 60