metadata
language:
- en
license: openrail++
pipeline_tag: depth-estimation
library_name: diffusers
tags:
- depth estimation
- image analysis
- computer vision
- in-the-wild
- zero-shot
pinned: true
Marigold Disparity v0.1 Model Card
The model is fine-tuned from the stable-diffusion-2
model as
described in our papers, in inverse depth (disparity) space, with "trailing" and "zero-snr":
- CVPR'2024 paper titled "Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation"
- Journal extension titled "Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis"
Using the model
- This model is for internal test.
- Get to the bottom of things with our official codebase.
- Developed by: Bingxin Ke, Anton Obukhov, Shengyu Huang, Nando Metzger, Rodrigo Caye Daudt, Konrad Schindler.
- Model type: Generative latent diffusion-based affine-invariant disparity (inverse depth) estimation from a single image.
- Language: English.
- License: Apache License License Version 2.0.
- Cite as:
@misc{ke2025marigold,
title={Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis},
author={Bingxin Ke and Kevin Qu and Tianfu Wang and Nando Metzger and Shengyu Huang and Bo Li and Anton Obukhov and Konrad Schindler},
year={2025},
eprint={2505.09358},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
@InProceedings{ke2023repurposing,
title={Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation},
author={Bingxin Ke and Anton Obukhov and Shengyu Huang and Nando Metzger and Rodrigo Caye Daudt and Konrad Schindler},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2024}
}