Lin Huang's picture

493 1

Lin Huang

Lin17

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation

upvoted a paper about 13 hours ago

Differential Mamba

upvoted a paper about 13 hours ago

Critiques of World Models

View all activity

Organizations

None yet

upvoted 12 papers about 13 hours ago

Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation

Paper • 2507.05963 • Published 4 days ago • 9

Differential Mamba

Paper • 2507.06204 • Published 4 days ago • 16

Critiques of World Models

Paper • 2507.05169 • Published 5 days ago • 20

StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling

Paper • 2507.05240 • Published 5 days ago • 40

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published 4 days ago • 49

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published 4 days ago • 40

Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Paper • 2507.07095 • Published 3 days ago • 47

Skywork-R1V3 Technical Report

Paper • 2507.06167 • Published 4 days ago • 56

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published 3 days ago • 69

LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS

Paper • 2507.07136 • Published 4 days ago • 21

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Paper • 2507.07982 • Published 2 days ago • 26

Scaling RL to Long Videos

Paper • 2507.07966 • Published 2 days ago • 105

upvoted 8 papers 8 days ago

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published 19 days ago • 6

Auto-Regressively Generating Multi-View Consistent Images

Paper • 2506.18527 • Published 19 days ago • 8

4Real-Video-V2: Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation

Paper • 2506.18839 • Published 24 days ago • 10

3D Arena: An Open Platform for Generative 3D Evaluation

Paper • 2506.18787 • Published 19 days ago • 12

DIP: Unsupervised Dense In-Context Post-training of Visual Representations

Paper • 2506.18463 • Published 19 days ago • 21

ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs

Paper • 2506.18792 • Published 19 days ago • 29

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published 19 days ago • 72

Light of Normals: Unified Feature Representation for Universal Photometric Stereo

Paper • 2506.18882 • Published 19 days ago • 84