Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kesenZhaoNTU
/
UV-CoT
like
4
Image-Text-to-Text
Transformers
Safetensors
llava_llama
text-generation
multimodal
chain-of-thought
arxiv:
2504.18397
License:
cc-by-nc-nd-4.0
Model card
Files
Files and versions
xet
Community
1
Train
Deploy
Use this model
5ee53d9
UV-CoT
/
images
Ctrl+K
Ctrl+K
2 contributors
History:
1 commit
kesenZhaoNTU
Upload 3 files
47bf470
verified
about 1 month ago
fig1.svg
Safe
7.42 MB
Upload 3 files
about 1 month ago
fig5_v1.2.svg
Safe
3.68 MB
Upload 3 files
about 1 month ago
fig6_v1.2.svg
Safe
2.96 MB
Upload 3 files
about 1 month ago