Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ertghiu256
/
qwen-3-4b-mixture-of-thought
like
2
PyTorch
open-r1/Mixture-of-Thoughts
PSM24/gemini-2.5-pro-100x
qwen3
unsloth
trl
sft
cot
reasoning
think
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Model Info:
Usage:
Model Info:
A small qwen 3 model trained on 34000 data collected from open-r1/mixture-of-thought.
Usage:
Solve math
Generate codes
Thinking
Downloads last month
23
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for
ertghiu256/qwen-3-4b-mixture-of-thought
Base model
Qwen/Qwen3-4B-Base
Finetuned
Qwen/Qwen3-4B
Finetuned
(
101
)
this model
Datasets used to train
ertghiu256/qwen-3-4b-mixture-of-thought
open-r1/Mixture-of-Thoughts
Viewer
•
Updated
20 days ago
•
699k
•
35.4k
•
220
PSM24/gemini-2.5-pro-100x
Viewer
•
Updated
May 6
•
100
•
151
•
6
Collection including
ertghiu256/qwen-3-4b-mixture-of-thought
Qwen 3 4b Finetuned
Collection
6 items
•
Updated
1 day ago