Bingxin's picture
Update README.md
7d90285 verified
metadata
language:
  - en
license: openrail++
pipeline_tag: depth-estimation
library_name: diffusers
tags:
  - depth estimation
  - image analysis
  - computer vision
  - in-the-wild
  - zero-shot
pinned: true

Marigold Disparity v0.1 Model Card

Image Depth diffusers Github Website arXiv License

The model is fine-tuned from the stable-diffusion-2 model as described in our papers, in inverse depth (disparity) space, with "trailing" and "zero-snr":

  • CVPR'2024 paper titled "Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation"
  • Journal extension titled "Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis"

Using the model

@misc{ke2025marigold,
  title={Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis},
  author={Bingxin Ke and Kevin Qu and Tianfu Wang and Nando Metzger and Shengyu Huang and Bo Li and Anton Obukhov and Konrad Schindler},
  year={2025},
  eprint={2505.09358},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@InProceedings{ke2023repurposing,
  title={Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation},
  author={Bingxin Ke and Anton Obukhov and Shengyu Huang and Nando Metzger and Rodrigo Caye Daudt and Konrad Schindler},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2024}
}