Submitted by nastasia-y 57 GHOST 2.0: generative high-fidelity one shot transfer of heads · 5 authors 2
Submitted by akhaliq 34 TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding · 6 authors 2
Submitted by jiminHuang 29 Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance · 10 authors 2
Submitted by AggarwalTushar 23 Language Models' Factuality Depends on the Language of Inquiry · 6 authors 2
Submitted by CheeryLJH 19 Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning? · 10 authors 2
Submitted by Wesleythu 17 Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems · 7 authors 2
Submitted by shash42 16 Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation · 6 authors 2
Submitted by orionweller 14 Rank1: Test-Time Compute for Reranking in Information Retrieval · 6 authors 2
Submitted by AmeyaPrabhu 14 Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs · 12 authors 2
Submitted by vyokky 8 VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model · 10 authors 2
Submitted by DrChiZhang 7 Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator · 6 authors 4
Submitted by nonstopfor 5 AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement · 16 authors 2
Submitted by Asap7772 4 FSPO: Few-Shot Preference Optimization of Synthetic Preference Data in LLMs Elicits Effective Personalization to Real Users · 8 authors 2
Submitted by Taishi-N324 4 Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization · 6 authors 2
Submitted by ThePyProgrammer 3 Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications · 9 authors 1
Submitted by kailinjiang 3 MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge · 7 authors 1
Submitted by AzureLeon1 3 MolSpectra: Pre-training 3D Molecular Representation with Multi-modal Energy Spectra · 7 authors 2
Submitted by rohitsaxena 2 PosterSum: A Multimodal Benchmark for Scientific Poster Summarization · 3 authors 2
Submitted by SteveZeyuZhang 1 DOEI: Dual Optimization of Embedding Information for Attention-Enhanced Class Activation Maps · 9 authors 2