Tucan Collection A series of open-source Bulgarian language models fine-tuned for function calling and tool use. 2.6B, 9B, and 27B parameter variants. β’ 12 items β’ Updated 2 days ago β’ 1
view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen β’ Jan 15 β’ 195
CoLLM: A Large Language Model for Composed Image Retrieval Paper β’ 2503.19910 β’ Published Mar 25 β’ 15
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 By tomaarsen β’ May 28, 2024 β’ 230
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen β’ Mar 26 β’ 143
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper β’ 2503.11576 β’ Published Mar 14 β’ 109
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google By ariG23498 and 2 others β’ Feb 19 β’ 70
view article Article SigLIP 2: A better multilingual vision language encoder By ariG23498 and 2 others β’ Feb 21 β’ 171
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper β’ 2402.03300 β’ Published Feb 5, 2024 β’ 123
Executable Code Actions Elicit Better LLM Agents Paper β’ 2402.01030 β’ Published Feb 1, 2024 β’ 154
view article Article Open-source DeepResearch β Freeing our search agents By m-ric and 4 others β’ Feb 4 β’ 1.26k
llama.vim Collection Recommended models for the llama.vim and llama.vscode plugins β’ 9 items β’ Updated May 14 β’ 41
view article Article Finally, a Replacement for BERT: Introducing ModernBERT By bclavie and 14 others β’ Dec 19, 2024 β’ 659
view article Article Welcome to Inference Providers on the Hub π₯ By julien-c and 6 others β’ Jan 28 β’ 483
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths β’ 3 items β’ Updated Apr 28 β’ 119
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! By andito and 2 others β’ Jan 23 β’ 181
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published Jan 22 β’ 405
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper β’ 2501.11425 β’ Published Jan 20 β’ 106