Submitted by EilamSha 103 TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations · 3 authors 4
Submitted by Wanfq 76 QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning · 10 authors 3
Submitted by Nardien 71 Distilling LLM Agent into Small Models with Retrieval and Code Tools · 5 authors 5
Submitted by BlackSamorez 70 Quartet: Native FP4 Training Can Be Optimal for Large Language Models · 8 authors 2
Submitted by yjyjyj98 59 Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models · 5 authors 2
Submitted by Ryan1122 55 One RL to See Them All: Visual Triple Unified Reinforcement Learning · 10 authors 2
Submitted by shenwzh3 39 QwenLong-CPRS: Towards infty-LLMs with Dynamic Context Optimization · 15 authors 3
Submitted by RyanLiu112 38 Scaling Image and Video Generation via Test-Time Evolutionary Search · 7 authors 2
Submitted by ZonglinY 30 MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback · 10 authors 3
Submitted by kwanyoung 29 Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model · 2 authors 3
Submitted by Gigglingface 18 Diffusion Classifiers Understand Compositionality, but Conditions Apply · 4 authors 3
Submitted by JusperLee 17 AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models · 32 authors 2
Submitted by LoYoT 16 Direct3D-S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention · 11 authors 2
Submitted by dalime 16 Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models · 8 authors 2
Submitted by pat-jj 16 s3: You Don't Need That Much Data to Train a Search Agent via RL · 7 authors 2
Submitted by SP2001 14 Teaching with Lies: Curriculum DPO on Synthetic Negatives for Hallucination Detection · 4 authors 2
Submitted by Kuvvi 14 FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow · 5 authors 2
Submitted by Jinyang23 14 Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities · 8 authors 2
Submitted by Yunqiu 11 Clear Nights Ahead: Towards Multi-Weather Nighttime Image Restoration · 5 authors 2
Submitted by alandao 10 Speechless: Speech Instruction Training Without Speech for Low Resource Languages · 9 authors 2
Submitted by MenghaoGuo 10 RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs · 15 authors 3
Submitted by ssz1111 10 Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning · 14 authors 5
Submitted by yanxi-chen 9 Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models · 13 authors 2
Submitted by ed1son 9 ScanBot: Towards Intelligent Surface Scanning in Embodied Robotic Systems · 6 authors 2
Submitted by oneonlee 8 Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study · 4 authors 2
Submitted by mrwu 7 RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning · 17 authors 2
Submitted by yisuanwang 6 DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation · 12 authors 2
Submitted by Lingaaaaaaa 6 Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning · 7 authors 2
Submitted by yifAI 5 On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning · 6 authors 2
Submitted by beanie00 5 ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection · 7 authors 2
Submitted by prateekv 5 Large Language Models Implicitly Learn to See and Hear Just By Reading · 2 authors 3
Submitted by ljcleo 3 Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models · 6 authors 2
Submitted by HwanChang0106 3 Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering · 4 authors 2
Submitted by BootsofLagrangian 3 Revisiting Residual Connections: Orthogonal Updates for Stable and Efficient Deep Networks · 5 authors 2
Submitted by Chaeeun-Kim 2 FREESON: Retriever-Free Retrieval-Augmented Reasoning via Corpus-Traversing MCTS · 2 authors 2
Submitted by rmahesh 2 Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA · 8 authors 2
Submitted by thinkwee 2 NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning · 6 authors 5
Submitted by songff 2 TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios · 8 authors 2
Submitted by 3ebdola 1 NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities · 5 authors 2
Submitted by liboaccn 1 FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation · 4 authors 2
Submitted by Wyattz23 - Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing · 9 authors 2