SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14 • 110
view article Article Gemma 3n fully available in the open-source ecosystem! By ariG23498 and 7 others • 13 days ago • 105
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 3.46M • • 2.48k
Trendyol/TY-ecomm-embed-multilingual-base-v1.2.0 Sentence Similarity • 0.3B • Updated May 13 • 449 • 30