Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
xinyiW915
/
ReLaX-VQA
like
1
Visual Question Answering
5 datasets
deep-learning
vision
VQA
Transformer
CNN
arxiv:
2407.11496
License:
apache-2.0
Model card
Files
Files and versions
Community
main
ReLaX-VQA
1 contributor
History:
15 commits
Xinyi Wang
update README
045b2f8
7 days ago
metadata
first commit
8 days ago
model
Upload model
8 days ago
src
first commit
8 days ago
ugc_original_videos
first commit
8 days ago
.gitattributes
Safe
1.6 kB
first commit
8 days ago
.gitignore
78 Bytes
Update
8 days ago
Framework.png
18.9 MB
LFS
first commit
8 days ago
README.md
9.52 kB
update README
7 days ago
reported_result.ipynb
66.8 kB
first commit
8 days ago
requirements.txt
2.57 kB
first commit
8 days ago