Update README.md
Browse files
README.md
CHANGED
@@ -22,14 +22,14 @@ Chain-of-thought (CoT) reasoning greatly improves the interpretability and probl
|
|
22 |
|
23 |
UV-CoT achieves this by performing preference comparisons between model-generated bounding boxes. It generates preference data automatically, then uses an evaluator MLLM (e.g., OmniLLM-12B) to rank responses, which serves as supervision to train the target MLLM (e.g., LLaVA-1.5-7B). This approach emulates human perception—identifying key regions and reasoning based on them—thereby improving visual comprehension, particularly in spatial reasoning tasks.
|
24 |
|
25 |
-
 to rank responses, which serves as supervision to train the target MLLM (e.g., LLaVA-1.5-7B). This approach emulates human perception—identifying key regions and reasoning based on them—thereby improving visual comprehension, particularly in spatial reasoning tasks.
|
24 |
|
25 |
+

|
26 |
|
27 |
## Visualizations
|
28 |
|
29 |
Qualitative examples demonstrating UV-CoT's visual reasoning:
|
30 |
|
31 |
+

|
32 |
+

|
33 |
|
34 |
## Installation
|
35 |
|