Balance Act: Mitigating Hubness in Cross-Modal Retrieval with Query and Gallery Banks Paper • 2310.11612 • Published Oct 17, 2023
InvGC: Robust Cross-Modal Retrieval by Inverse Graph Convolution Paper • 2310.13276 • Published Oct 20, 2023
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction Paper • 2503.15661 • Published Mar 19 • 2
GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks Paper • 2504.12764 • Published Apr 17 • 42
Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers Paper • 2505.21497 • Published May 27 • 103