Add a link to SmolSwallow-1.5B-Instruct
Browse files
README.md
CHANGED
@@ -14,6 +14,8 @@ base_model:
|
|
14 |
**SmolSwallow-1.5B** is a Japanese compact language model created through TAID (Temporally Adaptive Interpolated Distillation), our new knowledge distillation method.
|
15 |
We used [Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) as the teacher model and [Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) as the student model.
|
16 |
The model has been further pre-trained on Japanese text data to enhance its Japanese language capabilities.
|
|
|
|
|
17 |
|
18 |
## Model Details
|
19 |
|
|
|
14 |
**SmolSwallow-1.5B** is a Japanese compact language model created through TAID (Temporally Adaptive Interpolated Distillation), our new knowledge distillation method.
|
15 |
We used [Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) as the teacher model and [Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) as the student model.
|
16 |
The model has been further pre-trained on Japanese text data to enhance its Japanese language capabilities.
|
17 |
+
|
18 |
+
If you are looking for an instruction-following model, check [SmolSwallow-1.5B-Instruct](https://huggingface.co/SakanaAI/SmolSwallow-1.5B-Instruct).
|
19 |
|
20 |
## Model Details
|
21 |
|