zhangfan's picture

13 4

zhangfan

Fan-s

·

https://github.com/zhangfan-p

zhangfan-p

AI & ML interests

Video Generation, MultiModal Learning

Recent Activity

upvoted a paper 19 days ago

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

upvoted a paper about 1 month ago

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

upvoted a paper 5 months ago

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation

View all activity

Organizations

upvoted a paper 19 days ago

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published 19 days ago • 72

upvoted a paper about 1 month ago

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published Nov 27, 2025 • 29

upvoted 2 papers 5 months ago

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation

Paper • 2508.03694 • Published Aug 5, 2025 • 51

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29, 2025 • 136

upvoted 2 papers 6 months ago

Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy

Paper • 2506.22432 • Published Jun 27, 2025 • 13

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Paper • 2506.21356 • Published Jun 26, 2025 • 22

upvoted a paper 7 months ago

Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation

Paper • 2506.04225 • Published Jun 4, 2025 • 28

upvoted a paper 9 months ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published Mar 27, 2025 • 33

upvoted a paper 11 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3, 2025 • 61

upvoted 4 papers about 1 year ago

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Paper • 2412.09645 • Published Dec 10, 2024 • 36

Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Paper • 2412.09593 • Published Dec 12, 2024 • 18

Material Anything: Generating Materials for Any 3D Object via Diffusion

Paper • 2411.15138 • Published Nov 22, 2024 • 50

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 34