Edit images based on source and target prompts
Generate images using selected LoRA prompts
Generate novel views from a single image
4M: Massively Multimodal Masked Modeling
Describe images using multiple models
Generate 3D room layouts from RGB panoramas
Create images from various types of annotations