Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
xinyiW915
/
ReLaX-VQA
like
1
Visual Question Answering
5 datasets
deep-learning
vision
VQA
Transformer
CNN
arxiv:
2407.11496
License:
apache-2.0
Model card
Files
Files and versions
Community
main
ReLaX-VQA
/
ugc_original_videos
1 contributor
History:
1 commit
Xinyi Wang
first commit
211b431
8 days ago
5636101558_1080p.mp4
2.32 MB
LFS
first commit
8 days ago
5636101558_540p.mp4
1 MB
LFS
first commit
8 days ago
5636101558_720p.mp4
1.35 MB
LFS
first commit
8 days ago