Submitted by Seongyun 58 Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning · 4 authors 5
Submitted by lovodkin93 47 RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation · 11 authors 2
Submitted by Kaichengalex 32 Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs · 9 authors 2
Submitted by phillipinseoul 21 Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation · 6 authors 3
Submitted by XiaohuanZhou 15 QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining · 10 authors 2
Submitted by akhaliq 11 Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models · 25 authors 3
Submitted by Sta8is 6 Boosting Generative Image Modeling via Joint Image-Feature Synthesis · 5 authors 2
Submitted by akhaliq 4 3DV-TON: Textured 3D-Guided Consistent Video Try-on via Diffusion Models · 4 authors 2
Submitted by yaolily 4 TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos · 14 authors 2
Submitted by joanrodai 4 Distilling semantically aware orders for autoregressive image generation · 8 authors 2
Submitted by lwpyh 4 ViSMaP: Unsupervised Hour-long Video Summarisation by Meta-Prompting · 4 authors 2
Submitted by erikbergh 2 Interpretable non-linear dimensionality reduction using gaussian weighted linear transformation · 1 authors 2