marigold-disparity-affine-v0-1 / README.md

Update README.md

7d90285 verified 3 months ago

4.25 kB

	---
	language:
	- en
	license: openrail++
	pipeline_tag: depth-estimation
	library_name: diffusers
	tags:
	- depth estimation
	- image analysis
	- computer vision
	- in-the-wild
	- zero-shot
	pinned: true
	---

	<h1 align="center">Marigold Disparity v0.1 Model Card</h1>


	<p align="center">
	<a title="Image Depth" href="https://huggingface.co/spaces/prs-eth/marigold" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
	<img src="https://img.shields.io/badge/%F0%9F%A4%97%20Image%20Depth%20-Demo-yellow" alt="Image Depth">
	</a>
	<a title="diffusers" href="https://huggingface.co/docs/diffusers/using-diffusers/marigold_usage" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
	<img src="https://img.shields.io/badge/%F0%9F%A4%97%20diffusers%20-Integration%20🧨-yellow" alt="diffusers">
	</a>
	<a title="Github" href="https://github.com/prs-eth/marigold" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
	<img src="https://img.shields.io/github/stars/prs-eth/marigold?label=GitHub%20%E2%98%85&logo=github&color=C8C" alt="Github">
	</a>
	<a title="Website" href="https://marigoldcomputervision.github.io/" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
	<img src="https://img.shields.io/badge/%E2%99%A5%20Project%20-Website-blue" alt="Website">
	</a>
	<a title="arXiv" href="https://arxiv.org/abs/2505.09358" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
	<img src="https://img.shields.io/badge/%F0%9F%93%84%20Read%20-Paper-AF3436" alt="arXiv">
	</a>
	<!-- <a title="Social" href="https://twitter.com/antonobukhov1" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
	<img src="https://img.shields.io/twitter/follow/:?label=Subscribe%20for%20updates!" alt="Social">
	</a> -->
	<a title="License" href="https://huggingface.co/stabilityai/stable-diffusion-2/blob/main/LICENSE-MODEL" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
	<img src="https://img.shields.io/badge/License-OpenRAIL++-929292" alt="License">
	</a>
	</p>

	<!-- This is a model card for the `marigold-disparity-affine-v0-1` model for monocular depth estimation from a single image. -->

	The model is fine-tuned from the `stable-diffusion-2` [model](https://huggingface.co/stabilityai/stable-diffusion-2) as
	described in our papers, in inverse depth (disparity) space, with "trailing" and "zero-snr":
	<!-- (train_marigold_affine_disparity_iter_24000). -->
	- [CVPR'2024 paper](https://hf.co/papers/2312.02145) titled "Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation"
	- [Journal extension](https://hf.co/papers/2505.09358) titled "Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis"

	### Using the model
	- This model is for internal test.
	- Get to the bottom of things with our [official codebase](https://github.com/prs-eth/marigold).
	- Developed by: [Bingxin Ke](http://www.kebingxin.com/), [Anton Obukhov](https://www.obukhov.ai/), [Shengyu Huang](https://shengyuh.github.io/), [Nando Metzger](https://nandometzger.github.io/), [Rodrigo Caye Daudt](https://rcdaudt.github.io/), [Konrad Schindler](https://scholar.google.com/citations?user=FZuNgqIAAAAJ).
	- Model type: Generative latent diffusion-based affine-invariant disparity (inverse depth) estimation from a single image.
	- Language: English.
	- License: [Apache License License Version 2.0](https://www.apache.org/licenses/LICENSE-2.0).
	- Cite as:

	```bibtex
	@misc{ke2025marigold,
	title={Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis},
	author={Bingxin Ke and Kevin Qu and Tianfu Wang and Nando Metzger and Shengyu Huang and Bo Li and Anton Obukhov and Konrad Schindler},
	year={2025},
	eprint={2505.09358},
	archivePrefix={arXiv},
	primaryClass={cs.CV}
	}

	@InProceedings{ke2023repurposing,
	title={Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation},
	author={Bingxin Ke and Anton Obukhov and Shengyu Huang and Nando Metzger and Rodrigo Caye Daudt and Konrad Schindler},
	booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
	year={2024}
	}
	```