RobotArm-eff-single-action Collection Collection of VLMs model fine-tuned to predict a single action end-effector location/position to reach a target based on the prompts and camera feeds • 4 items • Updated May 9
Vision/multimodal Models Collection Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more! • 25 items • Updated 5 days ago • 12