Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kolerk 's Collections
TON

TON

updated 26 days ago

Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models.

Upvote
1

  • kolerk/TON-3B-AITZ

    Image-Text-to-Text • Updated 26 days ago • 17

  • kolerk/TON-3B-CLEVR

    Image-Text-to-Text • Updated 26 days ago • 14

  • kolerk/TON-3B-Math

    Image-Text-to-Text • Updated 26 days ago • 14

  • kolerk/TON-7B-Math

    Image-Text-to-Text • Updated 26 days ago • 17

  • kolerk/TON-AITZ-SFT

    Preview • Updated 26 days ago • 82

  • kolerk/TON-Math-SFT

    Viewer • Updated 26 days ago • 8.03k • 120

  • Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

    Paper • 2505.16854 • Published 27 days ago • 11
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs