TON-7B-Math / README.md

nielsr HF Staff

Add library_name and link to code

2d683fe verified about 1 month ago

preview code

raw

history blame

399 Bytes

metadata

base_model:
  - Qwen/Qwen2.5-VL-7B-Instruct
datasets:
  - kolerk/TON-Math-SFT
language:
  - en
license: apache-2.0
metrics:
  - accuracy
pipeline_tag: image-text-to-text
library_name: transformers

This is the model cited in the paper: Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models.

Code: https://github.com/kokolerk/TON