license: apache-2.0 | |
datasets: | |
- kolerk/TON-Math-SFT | |
language: | |
- en | |
metrics: | |
- accuracy | |
base_model: | |
- Qwen/Qwen2.5-VL-3B-Instruct | |
pipeline_tag: image-text-to-text | |
This is the model cited in the paper: [Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models](https://arxiv.org/abs/2505.16854). |