VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory Paper • 2506.18903 • Published 6 days ago • 18
Optimizing Multilingual Text-To-Speech with Accents & Emotions Paper • 2506.16310 • Published 11 days ago • 22
view article Article (LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware By derekl35 and 4 others • 11 days ago • 67
GenRecal: Generation after Recalibration from Large to Small Vision-Language Models Paper • 2506.15681 • Published 11 days ago • 36
Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor Paper • 2506.07932 • Published 21 days ago • 12
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO Paper • 2506.07464 • Published 21 days ago • 10
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models Paper • 2506.07177 • Published 22 days ago • 22
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper • 2506.08279 • Published 20 days ago • 26
Dreamland: Controllable World Creation with Simulator and Generative Models Paper • 2506.08006 • Published 20 days ago • 7
Splatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data Paper • 2506.04120 • Published 26 days ago • 7
PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers Paper • 2506.05573 • Published 24 days ago • 68
Ψ-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models Paper • 2506.01320 • Published 28 days ago • 16