view article Article FastRTC: The Real-Time Communication Library for Python By freddyaboulton and 1 other ⢠Feb 25 ⢠167
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper ⢠2502.14786 ⢠Published Feb 20 ⢠145
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper ⢠2502.02737 ⢠Published Feb 4 ⢠235
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain ⢠Jan 30 ⢠84
Contrastive Localized Language-Image Pre-Training Paper ⢠2410.02746 ⢠Published Oct 3, 2024 ⢠38
multilingual vision models Collection Some papers I read for understanding vision models and also adding multilingual capabilities to them ⢠14 items ⢠Updated Dec 11, 2024 ⢠2
Maya: An Instruction Finetuned Multilingual Multimodal Model Paper ⢠2412.07112 ⢠Published Dec 10, 2024 ⢠29
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper ⢠2410.15522 ⢠Published Oct 20, 2024 ⢠12
Transformer Explainer: Interactive Learning of Text-Generative Models Paper ⢠2408.04619 ⢠Published Aug 8, 2024 ⢠162
Federated Learning driven Large Language Models for Swarm Intelligence: A Survey Paper ⢠2406.09831 ⢠Published Jun 14, 2024 ⢠1
view article Article SmolLM - blazingly fast and remarkably powerful By loubnabnl and 2 others ⢠Jul 16, 2024 ⢠379
Cohere Labs Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. ⢠3 items ⢠Updated Apr 15 ⢠55