benchang1110
/

Qwen2.5-Taiwan-3B-Reason-SFT

Text Generation

text-generation-inference

Model card Files Files and versions Community

benchang1110 commited on Mar 23

Commit

20fc8d2

·

verified ·

1 Parent(s): 0e0919d

Update README.md

Files changed (1) hide show

README.md +7 -8

README.md CHANGED Viewed

@@ -25,14 +25,6 @@ library_name: transformers
 **注意**: 此模型的 tokenizer 和 [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) 是相同的 (經過簡繁轉換)，和 [benchang1110/Qwen2.5-Taiwan-3B-Instruct](https://huggingface.co/benchang1110/Qwen2.5-Taiwan-3B-Instruct)不同。
 若要生成簡體中文，可以直接使用 [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) 的 tokenizer 即可。
-### Choosing MATH Related Problems
-我們直接看 repo_name 是否與數學相關:
-```python
-dataset = load_dataset('benchang1110/Chinese-DeepSeek-R1-Distill-data-110k-opencc', split='train')
-dataset = dataset1.filter(lambda x: 'math' in x['repo_name'].lower())
-```
 ## Model Description
@@ -61,6 +53,13 @@ GPU Hours: A100*15h
 ![REASON_SFT_3B.png](REASON_SFT_3B.png)
 ## Uses
 此模型能用來回答數學問題，```<think>``` 已經加在 chat template 當中。

 **注意**: 此模型的 tokenizer 和 [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) 是相同的 (經過簡繁轉換)，和 [benchang1110/Qwen2.5-Taiwan-3B-Instruct](https://huggingface.co/benchang1110/Qwen2.5-Taiwan-3B-Instruct)不同。
 若要生成簡體中文，可以直接使用 [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) 的 tokenizer 即可。
 ## Model Description
 ![REASON_SFT_3B.png](REASON_SFT_3B.png)
+使用數學資料集微調
+```python
+dataset = load_dataset('benchang1110/Chinese-DeepSeek-R1-Distill-data-110k-opencc', split='train')
+dataset = dataset1.filter(lambda x: 'math' in x['repo_name'].lower())
+```
 ## Uses
 此模型能用來回答數學問題，```<think>``` 已經加在 chat template 當中。