Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding Paper • 2203.00867 • Published Mar 2, 2022
Local Consensus Enhanced Siamese Network with Reciprocal Loss for Two-view Correspondence Learning Paper • 2308.03217 • Published Aug 6, 2023
MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo Paper • 2401.11673 • Published Jan 22, 2024
CLUE: A Chinese Language Understanding Evaluation Benchmark Paper • 2004.05986 • Published Apr 13, 2020
VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing Paper • 2407.04461 • Published Jul 5, 2024
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion Paper • 2407.11398 • Published Jul 16, 2024 • 10
MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing Paper • 2408.08000 • Published Aug 15, 2024 • 9
SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer Paper • 2404.03736 • Published Apr 4, 2024
AnyRefill: A Unified, Data-Efficient Framework for Left-Prompt-Guided Vision Tasks Paper • 2502.11158 • Published Feb 16, 2025
MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model Paper • 2411.16157 • Published Nov 25, 2024
LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion Paper • 2507.05678 • Published Jul 8, 2025 • 1
EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion Paper • 2507.16535 • Published Jul 22, 2025 • 23
RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space Paper • 2508.08588 • Published Aug 12, 2025
WorldCompass: Reinforcement Learning for Long-Horizon World Models Paper • 2602.09022 • Published Feb 9 • 21
WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories Paper • 2603.02049 • Published Mar 2 • 17
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 14 days ago • 116
Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints Paper • 2303.02885 • Published Mar 6, 2023
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation Paper • 2504.14899 • Published Apr 21, 2025 • 20