Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper • 2505.02567 • Published 26 days ago • 73
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper • 2504.20438 • Published Apr 29 • 42
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper • 2504.20438 • Published Apr 29 • 42
stable-diffusion-v1-5/stable-diffusion-inpainting Text-to-Image • Updated Sep 6, 2024 • 2.4M • 59
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Paper • 2502.13144 • Published Feb 18 • 40
diffusers/stable-diffusion-xl-1.0-inpainting-0.1 Text-to-Image • Updated Sep 3, 2023 • 725k • 336
stabilityai/stable-diffusion-xl-refiner-1.0 Image-to-Image • Updated Sep 25, 2023 • 730k • 1.9k
stabilityai/stable-diffusion-xl-base-1.0 Text-to-Image • Updated Oct 30, 2023 • 3.13M • • 6.63k