KaLM-Embedding-V2: Superior Training Techniques and Data Inspire A Versatile Embedding Model Paper • 2506.20923 • Published 10 days ago • 2
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v2 Feature Extraction • 0.5B • Updated 8 days ago • 184 • 22
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published May 8 • 178
Runtime error 70 70 VLM R1 Referral Expression 💬 Mark regions in images based on text descriptions
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Paper • 2504.07615 • Published Apr 10 • 32
Runtime error 70 70 VLM R1 Referral Expression 💬 Mark regions in images based on text descriptions
omlab/VLM-R1-Qwen2.5VL-3B-OVD-0321 Zero-Shot Object Detection • 4B • Updated Apr 14 • 2.35k • 14
omlab/Qwen2.5VL-3B-VLM-R1-REC-500steps Zero-Shot Object Detection • 4B • Updated Apr 14 • 2.58k • 22
omlab/Qwen2.5VL-3B-VLM-R1-REC-500steps Zero-Shot Object Detection • 4B • Updated Apr 14 • 2.58k • 22