OPEA
/

SmallThinker-3B-Preview-int4-sym-gguf-q4-0-inc

Inference Endpoints

Model card Files Files and versions Community

cicdatopea commited on 13 days ago

Commit

574cfad

·

verified ·

1 Parent(s): c1a45e8

Update README.md

Files changed (1) hide show

README.md +15 -1

README.md CHANGED Viewed

@@ -146,8 +146,22 @@ text="Which number is bigger, 9.11 or 9.8?"
 pip3 install lm-eval==0.4.5
 ```bash
-auto-round --model "OPEA/Qwen2.5-7B-Instruct-int4-inc" --eval --eval_bs 16  --tasks leaderboard_ifeval,leaderboard_mmlu_pro,gsm8k,lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,arc_easy,arc_challenge,cmmlu,ceval-valid
 ```
 | Metric                                     |  BF16  |  INT4  |

 pip3 install lm-eval==0.4.5
+Convert the gguf model to hf model using the command:
+```python
+from transformers import AutoModelForCausalLM,AutoTokenizer
+model_id = "OPEA/SmallThinker-3B-Preview-int4-sym-gguf-q4-0-inc"
+filename = "SmallThinker-3B-Preview-3.1B-Q4_0.gguf"
+tokenizer = AutoTokenizer.from_pretrained(model_id, gguf_file=filename)
+model = AutoModelForCausalLM.from_pretrained(model_id, gguf_file=filename)
+model.save_pretrained("SmallThinker-3B-Preview-w4g32-gguf-q4-0-to-hf")
+tokenizer.save_pretrained("SmallThinker-3B-Preview-w4g32-gguf-q4-0-to-hf")
+```
 ```bash
+auto-round --model "SmallThinker-3B-Preview-w4g32-gguf-q4-0-to-hf" --eval --eval_bs 16  --tasks leaderboard_ifeval,leaderboard_mmlu_pro,gsm8k,lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,arc_easy,arc_challenge,cmmlu,mmlu,ceval-valid
 ```
 | Metric                                     |  BF16  |  INT4  |