eagle0504
/

qwen-2-5-3b-instruct-using-openai-gsm8k-gguf-data-enhanced-with-deepseek-v3-small

Model card Files Files and versions Community

eagle0504 commited on Mar 3

Commit

c43c4a1

·

verified ·

1 Parent(s): 9972876

Update README.md

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -33,7 +33,14 @@ This model is a fine-tuned version of **Qwen/Qwen2.5-3B-Instruct**, optimized fo
 - **Enhancement**: After fine-tuning on GSM8K, additional reasoning layers were introduced using **DeepSeek-V3-Small**, leading to richer, more interpretable answers.
 - **Training Objective**: Improve step-by-step mathematical reasoning and **enhance logical deductions** in model-generated responses.
-## How to Use
 You can load this model with `transformers`:

 - **Enhancement**: After fine-tuning on GSM8K, additional reasoning layers were introduced using **DeepSeek-V3-Small**, leading to richer, more interpretable answers.
 - **Training Objective**: Improve step-by-step mathematical reasoning and **enhance logical deductions** in model-generated responses.
+I have adopted some code from Unsloth and here's an updated [notebook](https://colab.research.google.com/drive/1HV0YkyiTD55j1xLRBHwJ_q3ex82W5EXr?usp=sharing) on Colab. Please feel free to copy it and run it yourself.
+You will need:
+- Huggingface token
+- Together.AI API Key
+- Unsloth package
+## How to Use Model for Inference
 You can load this model with `transformers`: