CogVLM2: Visual Language Models for Image and Video Understanding Paper β’ 2408.16500 β’ Published Aug 29 β’ 56 β’ 5
RARR: Researching and Revising What Language Models Say, Using Language Models Paper β’ 2210.08726 β’ Published Oct 17, 2022 β’ 1 β’ 2
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models Paper β’ 2404.17672 β’ Published Apr 26 β’ 18 β’ 2