Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published 7 days ago • 20
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 7 days ago • 46
view article Article Introducing smolagents: simple agents that write actions in code. 29 days ago • 531
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 14 days ago • 126
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging By akjindal53244 • Aug 19, 2024 • 76
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper • 2501.09751 • Published 12 days ago • 46
FAST: Efficient Action Tokenization for Vision-Language-Action Models Paper • 2501.09747 • Published 12 days ago • 23
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Paper • 2501.08292 • Published 14 days ago • 17