iwiwi commited on
Commit
e6e70a6
·
verified ·
1 Parent(s): e250fea

Add a link to SmolSwallow-1.5B-Instruct

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -14,6 +14,8 @@ base_model:
14
  **SmolSwallow-1.5B** is a Japanese compact language model created through TAID (Temporally Adaptive Interpolated Distillation), our new knowledge distillation method.
15
  We used [Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) as the teacher model and [Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) as the student model.
16
  The model has been further pre-trained on Japanese text data to enhance its Japanese language capabilities.
 
 
17
 
18
  ## Model Details
19
 
 
14
  **SmolSwallow-1.5B** is a Japanese compact language model created through TAID (Temporally Adaptive Interpolated Distillation), our new knowledge distillation method.
15
  We used [Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) as the teacher model and [Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) as the student model.
16
  The model has been further pre-trained on Japanese text data to enhance its Japanese language capabilities.
17
+
18
+ If you are looking for an instruction-following model, check [SmolSwallow-1.5B-Instruct](https://huggingface.co/SakanaAI/SmolSwallow-1.5B-Instruct).
19
 
20
  ## Model Details
21