DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning Paper • 2504.14509 • Published 4 days ago • 24
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment Paper • 2504.15585 • Published 2 days ago • 5
I-Con: A Unifying Framework for Representation Learning Paper • 2504.16929 • Published about 15 hours ago • 7
Decoupled Global-Local Alignment for Improving Compositional Understanding Paper • 2504.16801 • Published about 18 hours ago • 11
DreamO: A Unified Framework for Image Customization Paper • 2504.16915 • Published about 16 hours ago • 7
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models Paper • 2504.15279 • Published 3 days ago • 45
Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model Paper • 2504.15843 • Published 2 days ago • 11
Personalized Text-to-Image Generation with Auto-Regressive Models Paper • 2504.13162 • Published 7 days ago • 14
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities Paper • 2504.16078 • Published 1 day ago • 12
The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks Paper • 2504.15521 • Published 2 days ago • 53
RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild Paper • 2504.14977 • Published 3 days ago • 6
MR. Video: "MapReduce" is the Principle for Long Video Understanding Paper • 2504.16082 • Published 1 day ago • 4
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale Paper • 2504.16030 • Published 1 day ago • 19
WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents Paper • 2504.15785 • Published 2 days ago • 8
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning Paper • 2504.16080 • Published 1 day ago • 8
Vidi: Large Multimodal Models for Video Understanding and Editing Paper • 2504.15681 • Published 2 days ago • 12