Post
2753
visual reasoning is now in transformers π₯
THUDM/GLM-4.1V-9B-Thinking is just released and merged into transformers, we gave it a vibe test run π€
it's very good, comes with 64k context length and MIT license π
it supports 4k image tokens and any aspect ratio as well!
Notebook: http://colab.research.google.com/drive/1atODIiV57hOZLv16Bjzwd6fwx0yoTorj?usp=sharing
Demo: THUDM/GLM-4.1V-9B-Thinking-Demo
THUDM/GLM-4.1V-9B-Thinking is just released and merged into transformers, we gave it a vibe test run π€
it's very good, comes with 64k context length and MIT license π
it supports 4k image tokens and any aspect ratio as well!
Notebook: http://colab.research.google.com/drive/1atODIiV57hOZLv16Bjzwd6fwx0yoTorj?usp=sharing
Demo: THUDM/GLM-4.1V-9B-Thinking-Demo