Yan Shu's picture

Yan Shu

sy1998

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago

sy1998/tempsports

upvoted a paper about 2 months ago

Outline-Guided Object Inpainting with Diffusion Models

upvoted a paper about 2 months ago

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

View all activity

Organizations

commented a paper about 2 months ago

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Paper • 2506.05551 • Published Jun 5 • 5 •

commented a paper 2 months ago

EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models

Paper • 2506.01667 • Published Jun 2 • 21 •

New activity in sy1998/VidText 2 months ago

Upload 475 files

#9 opened 2 months ago by

Upload 77 files

#7 opened 2 months ago by

Upload 77 files

#8 opened 2 months ago by

Upload 556 files

#3 opened 2 months ago by

Upload 546 files

#4 opened 2 months ago by

Upload 707 files

#5 opened 2 months ago by

Upload 204 files

#2 opened 2 months ago by

Upload 517 files

#1 opened 2 months ago by

commented a paper 2 months ago

VidText: Towards Comprehensive Evaluation for Video Text Understanding

Paper • 2505.22810 • Published May 28 • 20 •