Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published 2 days ago • 64
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 18 days ago • 151
CogVLM2: Visual Language Models for Image and Video Understanding Paper • 2408.16500 • Published Aug 29, 2024 • 58
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining Paper • 2408.02657 • Published Aug 5, 2024 • 36
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts Paper • 2406.09162 • Published Jun 13, 2024 • 14