Visual Question-Visual Answering

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

ZichengD authored a paper about 1 month ago

Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss

ZichengD authored a paper about 1 month ago

VQ-VA World: Towards High-Quality Visual Question-Visual Answering

gouc published a Space 3 months ago

VQVA/README

View all activity

ZichengD

authored 2 papers about 1 month ago

Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss

Paper • 2501.07563 • Published Jan 13, 2025 • 1

VQ-VA World: Towards High-Quality Visual Question-Visual Answering

Paper • 2511.20573 • Published Nov 25, 2025 • 7

gouc

published a Space 3 months ago

README

🐨

heheyas

authored a paper 7 months ago

From Virtual Games to Real-World Play

Paper • 2506.18901 • Published Jun 23, 2025 • 10

gouc

authored 2 papers 8 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 154

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20, 2025 • 133

heheyas

authored a paper over 1 year ago

MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization

Paper • 2408.02555 • Published Aug 5, 2024 • 31

zw123

authored 2 papers over 1 year ago

Rejuvenating image-GPT as Strong Visual Representation Learners

Paper • 2312.02147 • Published Dec 4, 2023 • 7

What If We Recaption Billions of Web Images with LLaMA-3?

Paper • 2406.08478 • Published Jun 12, 2024 • 41

heheyas

authored a paper over 1 year ago

Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels

Paper • 2405.16822 • Published May 27, 2024 • 12

heheyas

authored a paper almost 2 years ago

V3D: Video Diffusion Models are Effective 3D Generators

Paper • 2403.06738 • Published Mar 11, 2024 • 30

heheyas

authored a paper over 2 years ago

Text-to-3D using Gaussian Splatting

Paper • 2309.16585 • Published Sep 28, 2023 • 31

zw123

authored 2 papers over 2 years ago

CLIPA-v2: Scaling CLIP Training with 81.1% Zero-shot ImageNet Accuracy within a \$10,000 Budget; An Extra \$4,000 Unlocks 81.8% Accuracy

Paper • 2306.15658 • Published Jun 27, 2023 • 12

An Inverse Scaling Law for CLIP Training

Paper • 2305.07017 • Published May 11, 2023 • 3

AI & ML interests

Recent Activity

Team members 5

VQVA's activity

README