HanSaem Kim

kensaem

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation

upvoted a paper about 16 hours ago

DINOv3

upvoted a paper 1 day ago

Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation

View all activity

Organizations

None yet

upvoted 2 papers about 16 hours ago

FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation

Paper • 2508.11255 • Published 4 days ago • 8

DINOv3

Paper • 2508.10104 • Published 6 days ago • 100

upvoted 3 papers 1 day ago

Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation

Paper • 2508.07901 • Published 8 days ago • 38

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

Paper • 2508.09983 • Published 6 days ago • 61

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published 5 days ago • 132

upvoted a paper 22 days ago

PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation

Paper • 2507.16116 • Published 29 days ago • 10

upvoted 9 papers about 1 month ago

StreamDiT: Real-Time Streaming Text-to-Video Generation

Paper • 2507.03745 • Published Jul 4 • 28

Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation

Paper • 2507.05963 • Published Jul 8 • 12

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 97

Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection

Paper • 2507.07994 • Published Jul 10 • 2

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published Jul 7 • 59

UGC-VideoCaptioner: An Omni UGC Video Detail Caption Model and New Benchmarks

Paper • 2507.11336 • Published Jul 15 • 4

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning

Paper • 2507.12841 • Published Jul 17 • 40

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8 • 115

SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8 • 110

upvoted 5 papers 2 months ago

SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers

Paper • 2506.00830 • Published Jun 1 • 7

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

Paper • 2506.05301 • Published Jun 5 • 55

HanSaem Kim

AI & ML interests

Recent Activity

Organizations

kensaem's activity