2 19 12

chenjie cao

ewrfcas

ewrfcas

AI & ML interests

computer vision

Recent Activity

authored a paper 8 days ago

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding

authored a paper 8 days ago

Local Consensus Enhanced Siamese Network with Reciprocal Loss for Two-view Correspondence Learning

authored a paper 8 days ago

MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo

View all activity

Organizations

None yet

authored 18 papers 8 days ago

Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding

Paper • 2203.00867 • Published Mar 2, 2022

Local Consensus Enhanced Siamese Network with Reciprocal Loss for Two-view Correspondence Learning

Paper • 2308.03217 • Published Aug 6, 2023

VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing

Paper • 2407.04461 • Published Jul 5, 2024

Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

Paper • 2407.11398 • Published Jul 16, 2024 • 10

MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing

Paper • 2408.08000 • Published Aug 15, 2024 • 9

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

Paper • 2404.03736 • Published Apr 4, 2024

AnyRefill: A Unified, Data-Efficient Framework for Left-Prompt-Guided Vision Tasks

Paper • 2502.11158 • Published Feb 16, 2025

MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model

Paper • 2411.16157 • Published Nov 25, 2024

LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion

Paper • 2507.05678 • Published Jul 8, 2025 • 1

EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Paper • 2507.16535 • Published Jul 22, 2025 • 23

RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space

Paper • 2508.08588 • Published Aug 12, 2025

WorldCompass: Reinforcement Learning for Long-Horizon World Models

Paper • 2602.09022 • Published Feb 9 • 21

WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories

Paper • 2603.02049 • Published Mar 2 • 17

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Paper • 2604.14268 • Published 14 days ago • 116

Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints

Paper • 2303.02885 • Published Mar 6, 2023

authored a paper about 1 year ago

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation

Paper • 2504.14899 • Published Apr 21, 2025 • 20

authored a paper about 2 years ago

Repositioning the Subject within Image

Paper • 2401.16861 • Published Jan 30, 2024 • 14

chenjie cao

AI & ML interests

Recent Activity

Organizations

ewrfcas's activity