SSR-VLM-7B / README.md
merve's picture
merve HF Staff
Fill out metadata and add explanation
f12f793 verified
|
raw
history blame
307 Bytes
metadata
base_model:
  - Qwen/Qwen2.5-7B-Instruct
pipeline_tag: image-text-to-text
tags:
  - depth-estimation

SSR-MIDI-7B

This model repository is for the models in the paper SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning.