yliu-cs
/

SSR-MIDI-7B

Model card Files Files and versions Community

SSR-MIDI-7B / README.md

merve's picture

merve HF Staff

Add pipeline tag and explanation

9fef08b verified 12 days ago

|

306 Bytes

	---
	base_model:
	- Qwen/Qwen2.5-7B-Instruct
	pipeline_tag: image-text-to-text
	tags:
	- depth-estimation
	---
	## SSR-MIDI-7B
	This model repository is for the models in the paper [SSR: Enhancing Depth Perception in Vision-Language
	Models via Rationale-Guided Spatial Reasoning](https://arxiv.org/abs/2505.12448).