Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated 17 days ago • 72
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated 6 days ago • 94
view article Article The Complete Guide to AI Architectures: From Neural Networks to Foundation Models By ProCreations • 6 days ago • 1
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • 9 days ago • 572
VBART Finetuned Models Collection VBART model finetuned to specific cases. • 10 items • Updated May 15, 2024 • 2
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 10 days ago • 548
SmolLM3 pretraining datasets Collection datasets used in SmolLM3 pretraining • 14 items • Updated 10 days ago • 20
🧠SmolLM3 Collection Smol, multilingual, long-context reasoner • 10 items • Updated 7 days ago • 59
Tar Collection Unifying Visual Understanding and Generation via Text-Aligned Representations • 5 items • Updated 16 days ago • 14
Falcon-H1 Collection Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 37 items • Updated Jun 13 • 47
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • May 21 • 188
Simple Collection A series of simple datasets and soon models by me! • 4 items • Updated 11 days ago • 3
view article Article Transformers Are Getting Old: Variants and Alternatives Exist! By ProCreations • 13 days ago • 40
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated 7 days ago • 151
GLiNER-X Collection The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type. • 6 items • Updated 24 days ago • 19