Submitted by Nicolas-BZRD 49 Should We Still Pretrain Encoders with Masked Language Modeling? · 8 authors 59 5
Submitted by JunhaoZhuang 31 4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture · 7 authors 1
Submitted by RunpeiDong 28 DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge · 13 authors 29 1
Submitted by RowitZou 26 Pre-Trained Policy Discriminators are General Reward Models · 22 authors 40 1
Submitted by KYLN24 19 BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset · 15 authors 6 1
Submitted by hiyouga 12 Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents · 7 authors 9.2k 1
Submitted by Bibaolong 12 RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs · 10 authors 1
Submitted by ZZXF 8 Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration · 8 authors 24 1
Submitted by xxzcc 7 ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation · 32 authors 1
Submitted by justinyyy 7 OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding · 7 authors 1
Submitted by ziyjiang 4 VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents · 13 authors 292 1
Submitted by SteveZeyuZhang 4 PresentAgent: Multimodal Agent for Presentation Video Generation · 7 authors 11 1
Submitted by cedricbonhomme 4 VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification · 2 authors 11 1
Submitted by danielchyeh 3 Beyond Simple Edits: X-Planner for Complex Instruction-Based Image Editing · 7 authors 1
Submitted by jannalu 1 Evaluating LLMs on Real-World Forecasting Against Human Superforecasters · 1 authors 2
Submitted by amanchadha 1 MOD-X: A Modular Open Decentralized eXchange Framework proposal for Heterogeneous Interoperable Artificial Agents · 5 authors 1
Submitted by ashutosh1919 1 Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky · 3 authors 1