Cosmos-Reason1 Collection Multimodal world understanding through reasoning • 5 items • Updated 3 days ago • 22
view article Article NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning By PranjaliJoshi and 1 other • 5 days ago • 22
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 4 days ago • 50
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published Apr 22 • 61
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control Paper • 2503.14492 • Published Mar 18 • 18
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published Mar 18 • 47
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw and 1 other • Jan 7 • 25
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models Paper • 2411.07126 • Published Nov 11, 2024 • 31
Cosmos Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated 4 days ago • 40