arxiv:2501.05452
Xingyu Fu
Fiaa
AI & ML interests
NLP, multimodal
Recent Activity
liked
a dataset
5 days ago
deepcs233/Visual-CoT
liked
a model
11 days ago
stabilityai/stable-video-diffusion-img2vid-xt
authored
a paper
15 days ago
ReFocus: Visual Editing as a Chain of Thought for Structured Image
Understanding