PowerInfer
/

TurboSparse-Mixtral

Feature Extraction

turbosparsemixtral

Model card Files Files and versions Community

yixinsong commited on May 31, 2024

Commit

e917fa2

·

verified ·

1 Parent(s): 957776b

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -14,6 +14,14 @@ The SuperSparse-Mixtral Large Language Model (LLM) is an sparsified version of t
 Our code for accelerating SuperSparse-Mixtral is currently being refined. Stay tuned! Now you can run this model like dense model.
 ## Allow Finetuning
 As we merged the predictors for FFN neurons in models, you can finetune SuperSparse-Mixtral with any framework and algorithm.

 Our code for accelerating SuperSparse-Mixtral is currently being refined. Stay tuned! Now you can run this model like dense model.
+## Chat-Template
+During sparsification, we also utilize some SFT datasets.
+We take ChatML as our chat template:
+```
+<|im_start|>user\n{{content}}<|im_end|>\n<|im_start|>assistant\n
+```
 ## Allow Finetuning
 As we merged the predictors for FFN neurons in models, you can finetune SuperSparse-Mixtral with any framework and algorithm.