Running on Zero 64 64 VLM R1 Referral Expression ๐ฌ Mark regions in images based on text descriptions