PatFig: Generating Short and Long Captions for Patent Figures Paper โข 2309.08379 โข Published Sep 15, 2023
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper โข 2506.01844 โข Published Jun 2 โข 128