Submitted by wenyi 139 GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning · 77 authors 361 3
Submitted by yilunzhao 33 SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks · 18 authors 26 2
Submitted by yuexiang96 32 Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning · 9 authors 11 2
Submitted by Haon-Chen 30 MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings · 7 authors 32 1
Submitted by Lmxyy 28 Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation · 14 authors 237 3
Submitted by Sansa 17 DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation · 7 authors 132 1
Submitted by fushh7 10 HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context · 10 authors 12 1
Submitted by RanjanSapkota 9 Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact · 20 authors 2
Submitted by Amar-S 8 Training for X-Ray Vision: Amodal Segmentation, Amodal Content Completion, and View-Invariant Object Representation from Multi-Camera Video · 5 authors 1
Submitted by puar-playground 5 MusiXQA: Advancing Visual Music Understanding in Multimodal Large Language Models · 9 authors 1
Submitted by Simase 4 FreeLong++: Training-Free Long Video Generation via Multi-band SpectralFusion · 2 authors 1
Submitted by AdinaY 4 IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering · 10 authors 26 1
Submitted by amanchadha 4 Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images · 7 authors 1
Submitted by huxueyu 3 Mixture of Reasonings: Teach Large Language Models to Reason with Adaptive Strategies · 4 authors 1
Submitted by Peter2023HuggingFace 1 FreNBRDF: A Frequency-Rectified Neural Material Representation · 3 authors 4 1
Submitted by AmirHossein-razlighi 1 Confident Splatting: Confidence-Based Compression of 3D Gaussian Splatting via Learnable Beta Distributions · 3 authors 1