yliu-cs
/

SSR-VLM-7B

Model card Files Files and versions Community

SSR-VLM-7B / README.md

merve's picture

merve HF Staff

Fill out metadata and add explanation

f12f793 verified 10 days ago

|

307 Bytes

metadata

base_model:
  - Qwen/Qwen2.5-7B-Instruct
pipeline_tag: image-text-to-text
tags:
  - depth-estimation

SSR-MIDI-7B

This model repository is for the models in the paper SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning.