Submitted by PhoenixZ 52 OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference · 13 authors 1
Submitted by jt-zhang 38 SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference · 7 authors 1
Submitted by akhaliq 28 SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution · 9 authors 3
Submitted by xilluill 26 KV-Edit: Training-Free Image Editing for Precise Background Preservation · 4 authors 2
Submitted by GlyphByT5 21 ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation · 17 authors 2
Submitted by Lucky2022 14 Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective · 5 authors 1
Submitted by Taoer 12 Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models · 6 authors 1
Submitted by AmberLJC 8 Curie: Toward Rigorous and Automated Scientific Experimentation with AI Agents · 10 authors 1
Submitted by Paper99 8 K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs · 3 authors 1
Submitted by rp-yu 5 Introducing Visual Perception Token into Multimodal Large Language Model · 3 authors 1
Submitted by Dominic789654 4 The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve? · 7 authors 1
Submitted by oceanpty 3 Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization · 7 authors 1
Submitted by twigs 2 LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models · 2 authors 1
Submitted by SyedAbdul 2 Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI · 3 authors 1
Submitted by jrzhang 1 MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs · 4 authors 1
Submitted by ahmedselhady - WiCkeD: A Simple Method to Make Multiple Choice Benchmarks More Challenging · 3 authors 1