The base of this model is Qwen2.5-3B-Instruct, using TopiOCQA as the training data, and the training method is ConvSearch-R1.

The code is available here. Please refer to the paper here.

Downloads last month
40
Safetensors
Model size
3.09B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for BeastyZ/Qwen2.5-3B-ConvSearch-R1-TopiOCQA

Base model

Qwen/Qwen2.5-3B
Finetuned
(630)
this model