Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2506.09113

about 9 hours ago

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 29
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 13
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published 7 days ago • 80

WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens

Paper • 2401.09985 • Published Jan 18, 2024 • 18
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects

Paper • 2401.09962 • Published Jan 18, 2024 • 9
Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution

Paper • 2401.10404 • Published Jan 18, 2024 • 11
ActAnywhere: Subject-Aware Video Background Generation

Paper • 2401.10822 • Published Jan 19, 2024 • 13

about 19 hours ago

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published 7 days ago • 80

Video Generation

about 4 hours ago

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published 7 days ago • 80
Video World Models with Long-term Spatial Memory

Paper • 2506.05284 • Published 12 days ago • 50

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 404
Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 139
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

Paper • 2409.12576 • Published Sep 19, 2024 • 16
Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 162

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published 7 days ago • 80

Video Understanding

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 95
Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 79
Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published 7 days ago • 80

Video Generation

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published 7 days ago • 80
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Paper • 2506.08009 • Published 8 days ago • 18
Seeing Voices: Generating A-Roll Video from Audio with Mirage

Paper • 2506.08279 • Published 8 days ago • 23
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement

Paper • 2506.07848 • Published 8 days ago • 4

FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing

Paper • 2506.05046 • Published 12 days ago • 2
Image Editing As Programs with Diffusion Models

Paper • 2506.04158 • Published 13 days ago • 24
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement

Paper • 2506.07848 • Published 8 days ago • 4
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Paper • 2503.05978 • Published Mar 7 • 35

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs