view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand qgallouedec • Dec 4, 2025 • 69
Common Pile v0.1 Collection All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6, 2025 • 40
Trinity Collection Collection of Arcee AI models in the Trinity family • 14 items • Updated Mar 25 • 30
Holo2 Collection Holo2 - Cost-Efficient Models for Cross-Platform Computer-Use Agents • 4 items • Updated Feb 2 • 27
view article Article mem-agent: Equipping LLM Agents with Memory Using RL driaforall • Oct 9, 2025 • 33
GTA1 Collection A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31, 2025 • 5
view article Article Smol2Operator: Post-Training GUI Agents for Computer Use +3 A-Mahla, merve, sergiopaniego, reach-vb, lewtun • Sep 23, 2025 • 138
Holo1.5 Collection Holo1.5 - Open Foundation Models for Computer Use Agents • 5 items • Updated Sep 15, 2025 • 35
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Paper • 2509.02544 • Published Sep 2, 2025 • 127
The Well Collection A 15TB collection of physics simulation datasets. • 18 items • Updated Mar 24, 2025 • 51
GUI Datasets Collection Datasets from the graphical user interfaces domain (screenshots). • 20 items • Updated Dec 3, 2024 • 8
V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 216
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial open-r1 • Jan 31, 2025 • 51