view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk β’ Oct 7, 2024 β’ 39
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper β’ 2503.11576 β’ Published Mar 14 β’ 108
view article Article DeepSearch Using Visual RAG in Agentic Frameworks π By paultltc and 1 other β’ Mar 21 β’ 32
view article Article ViDoRe Benchmark V2: Raising the Bar for Visual Retrieval By manu and 2 others β’ Mar 18 β’ 10
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! By andito and 2 others β’ Jan 23 β’ 178
view article Article SmolVLM - small yet mighty Vision Language Model By andito and 4 others β’ Nov 26, 2024 β’ 291
view article Article Introducing smolagents: simple agents that write actions in code. By m-ric and 2 others β’ Dec 31, 2024 β’ 1.04k
RegMix: Data Mixture as Regression for Language Model Pre-training Paper β’ 2407.01492 β’ Published Jul 1, 2024 β’ 39
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. β’ 14 items β’ Updated Feb 25 β’ 16
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 252
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Dec 6, 2024 β’ 769