JasperHaozhe commited on
Commit
477cc28
·
verified ·
1 Parent(s): ea66ff2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -1
README.md CHANGED
@@ -10,5 +10,10 @@ base_model:
10
  - Qwen/Qwen2.5-VL-7B-Instruct
11
  pipeline_tag: question-answering
12
  ---
 
13
 
14
- This is the model trained with https://github.com/TIGER-AI-Lab/Pixel-Reasoner/.
 
 
 
 
 
10
  - Qwen/Qwen2.5-VL-7B-Instruct
11
  pipeline_tag: question-answering
12
  ---
13
+ The model is trained with curiosity-driven RL described in [paper](https://arxiv.org/abs/2505.15966).
14
 
15
+ We have released vllm based inference code at https://github.com/TIGER-AI-Lab/Pixel-Reasoner/.
16
+
17
+ We will release a simple hf.generate() based inference code.
18
+
19
+ Please also play with the cool [interactive demo](https://huggingface.co/spaces/TIGER-Lab/Pixel-Reasoner)