ViSMaP: Unsupervised Hour-long Video Summarisation by Meta-Prompting Paper • 2504.15921 • Published 5 days ago • 4
TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos Paper • 2504.17343 • Published 3 days ago • 4
3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models Paper • 2504.17414 • Published 3 days ago • 5
DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs Paper • 2504.17040 • Published 3 days ago • 8
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Paper • 2504.17789 • Published 2 days ago • 11
QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining Paper • 2504.16511 • Published 4 days ago • 15
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper • 2504.17207 • Published 3 days ago • 23
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs Paper • 2504.17432 • Published 3 days ago • 32
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Paper • 2504.17192 • Published 3 days ago • 62
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation Paper • 2504.17502 • Published 3 days ago • 49
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published 2 days ago • 65
CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation Paper • 2504.15254 • Published 5 days ago • 4
Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA Paper • 2504.10419 • Published 12 days ago • 4
Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading Paper • 2504.11919 • Published 11 days ago • 10
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment Paper • 2504.15585 • Published 5 days ago • 10
Decoupled Global-Local Alignment for Improving Compositional Understanding Paper • 2504.16801 • Published 3 days ago • 14
I-Con: A Unifying Framework for Representation Learning Paper • 2504.16929 • Published 3 days ago • 26