prithivMLmods
/

Crux-Qwen3_OpenThinking-4B

Text Generation

text-generation-inference

Model card Files Files and versions

prithivMLmods commited on May 23

Commit

accd468

·

verified ·

1 Parent(s): d405de6

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -17,6 +17,8 @@ tags:
 - code
 ---
 # Crux-Qwen3\_OpenThinking-4B
 > **Crux-Qwen3\_OpenThinking-4B** is fine-tuned on the **Qwen3-4B** architecture, optimized for advanced **open thinking**, **mathematical reasoning**, and **logical problem solving**. This model is trained on the traces of **sk1.1**, which include 1,000 entries from the **Gemini thinking trajectory**, combined with fine-tuning on 100k tokens of **open math reasoning** data. This makes it highly effective for nuanced reasoning, educational tasks, and complex problem-solving requiring clear thought processes.
@@ -106,4 +108,4 @@ print(response)
 ## References
-1. [YaRN: Efficient Context Window Extension of Large Language Models](https://arxiv.org/pdf/2309.00071)

 - code
 ---
+![zdfbdccf.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/4XCMQEsE0mv2s5rx-YdIK.png)
 # Crux-Qwen3\_OpenThinking-4B
 > **Crux-Qwen3\_OpenThinking-4B** is fine-tuned on the **Qwen3-4B** architecture, optimized for advanced **open thinking**, **mathematical reasoning**, and **logical problem solving**. This model is trained on the traces of **sk1.1**, which include 1,000 entries from the **Gemini thinking trajectory**, combined with fine-tuning on 100k tokens of **open math reasoning** data. This makes it highly effective for nuanced reasoning, educational tasks, and complex problem-solving requiring clear thought processes.
 ## References
+1. [YaRN: Efficient Context Window Extension of Large Language Models](https://arxiv.org/pdf/2309.00071)