Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing Paper • 2504.21356 • Published Apr 30
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs Paper • 2504.17432 • Published Apr 24 • 39
EliGen: Entity-Level Controlled Image Generation with Regional Attention Paper • 2501.01097 • Published Jan 2