-
38
Llama 3.2V 11B Cot
💬Chat about images with text input
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • 11B • Updated • 2.31k • 155 -
Xkev/LLaVA-CoT-100k
Viewer • Updated • 98.6k • 1.52k • 97 -
LLaVA-o1: Let Vision Language Models Reason Step-by-Step
Paper • 2411.10440 • Published • 130
Guowei Xu PRO
Xkev
AI & ML interests
None yet
Recent Activity
updated
a Space
1 day ago
Xkev/Llama-3.2V-11B-cot
upvoted
a
paper
9 days ago
metaTextGrad: Automatically optimizing language model optimizers
liked
a model
23 days ago
deepseek-ai/DeepSeek-V3.1-Base
Organizations
None yet