Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
BryanADA
/
Qwen2.5-3B-cot-zh-tw
like
1
Text Generation
Transformers
GGUF
DoggiAI/GSM8K_zh_tw
Chinese
English
qwen2
text-generation-inference
chain-of-thought
qwen
traditional-chinese
reasoning
rlhf
grpo
cot
local-deploy
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
BryanADA
commited on
May 23
Commit
c331152
·
verified
·
1 Parent(s):
98bbc32
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-1
README.md
CHANGED
Viewed
@@ -24,7 +24,7 @@ base_model:
24
- Qwen/Qwen2.5-3B-Instruct
25
---
26
27
-
# Qwen-2.5-3B-CoT-ZH-TW (GRPO
RLHF 啟發式多步推理優化版
)
28
29
30
---
24
- Qwen/Qwen2.5-3B-Instruct
25
---
26
27
+
# Qwen-2.5-3B-CoT-ZH-TW (GRPO)
28
29
30
---