SATORI-R1: Incentivizing Multimodal Reasoning with Spatial Grounding and Verifiable Rewards Paper • 2505.19094 • Published 5 days ago • 2 • 2
Reverse Preference Optimization for Complex Instruction Following Paper • 2505.22172 • Published 1 day ago • 4 • 2
VideoGameBench: Can Vision-Language Models complete popular video games? Paper • 2505.18134 • Published 6 days ago • 6 • 3
DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction Paper • 2505.21473 • Published 2 days ago • 12 • 2
Precise In-Parameter Concept Erasure in Large Language Models Paper • 2505.22586 • Published 1 day ago • 1 • 2
Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods Paper • 2505.17870 • Published 7 days ago • 4 • 2
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO Paper • 2505.22453 • Published 1 day ago • 37 • 2
MangaVQA and MangaLMM: A Benchmark and Specialized Model for Multimodal Manga Understanding Paper • 2505.20298 • Published 3 days ago • 3 • 2
Prot2Token: A Unified Framework for Protein Modeling via Next-Token Prediction Paper • 2505.20589 • Published 3 days ago • 6 • 2
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research Paper • 2505.19253 • Published 4 days ago • 22 • 2
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities Paper • 2505.21191 • Published 3 days ago • 2 • 1
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs Paper • 2505.19075 • Published 5 days ago • 17 • 2
Efficient Data Selection at Scale via Influence Distillation Paper • 2505.19051 • Published 5 days ago • 3 • 2
Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles Paper • 2505.21060 • Published 3 days ago • 3 • 2
Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph Paper • 2505.17507 • Published 7 days ago • 3 • 2
LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling Paper • 2505.19187 • Published 4 days ago • 12 • 2
Fostering Video Reasoning via Next-Event Prediction Paper • 2505.22457 • Published 1 day ago • 27 • 2
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published 1 day ago • 86 • 3
HoPE: Hybrid of Position Embedding for Length Generalization in Vision-Language Models Paper • 2505.20444 • Published 3 days ago • 2 • 2