kolerk
/

TON-3B-Math

Image-Text-to-Text

Model card Files Files and versions Community

TON-3B-Math / README.md

kolerk's picture

Create README.md

754715b verified 27 days ago

|

history blame contribute delete

332 Bytes

	---
	license: apache-2.0
	datasets:
	- kolerk/TON-Math-SFT
	language:
	- en
	metrics:
	- accuracy
	base_model:
	- Qwen/Qwen2.5-VL-3B-Instruct
	pipeline_tag: image-text-to-text
	---
	This is the model cited in the paper: [Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models](https://arxiv.org/abs/2505.16854).