Adrian Lepers
AdrianLepers
AI & ML interests
None yet
Recent Activity
liked
a model
about 6 hours ago
Qwen/Qwen-Image
liked
a Space
about 6 hours ago
Jimmyzheng-10/ScreenCoder
reacted
to
sergiopaniego's
post
with π₯
about 6 hours ago
Want to learn how to align a Vision Language Model (VLM) for reasoning using GRPO and TRL? π
π§βπ³ We've got you covered!!
NEW multimodal post training recipe to align a VLM using TRL in @HuggingFace's Cookbook.
Go to the recipe πhttps://huggingface.co/learn/cookbook/fine_tuning_vlm_grpo_trl
Powered by the latest TRL v0.20 release, this recipe shows how to teach Qwen2.5-VL-3B-Instruct to reason over images π