Upload folder using huggingface_hub
Browse files- .gitattributes +0 -3
- README.md +0 -5
.gitattributes
CHANGED
@@ -33,6 +33,3 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
-
assets/performance/kanana-1.5-radar.png filter=lfs diff=lfs merge=lfs -text
|
37 |
-
assets/performance/niah-32.5b-base.png filter=lfs diff=lfs merge=lfs -text
|
38 |
-
assets/performance/niah-32.5b-inst.png filter=lfs diff=lfs merge=lfs -text
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
README.md
CHANGED
@@ -52,11 +52,6 @@ training_regime: bf16 mixed precision
|
|
52 |
|
53 |
`Kanana 1.5`, a newly introduced version of the Kanana model family, presents substantial enhancements in **coding, mathematics, and function calling capabilities** over the previous version, enabling broader application to more complex real-world problems. This new version now can handle __up to 32K tokens length natively and up to 128K tokens using YaRN__, allowing the model to maintain coherence when handling extensive documents or engaging in extended conversations. Furthermore, Kanana 1.5 delivers more natural and accurate conversations through a __refined post-training process__.
|
54 |
|
55 |
-
<p align="center">
|
56 |
-
<picture>
|
57 |
-
<img src="assets/performance/kanana-1.5-radar.png", width="700" style="margin: 40px auto;">
|
58 |
-
</picture>
|
59 |
-
|
60 |
> [!Note]
|
61 |
> Neither the pre-training nor the post-training data includes Kakao user data.
|
62 |
|
|
|
52 |
|
53 |
`Kanana 1.5`, a newly introduced version of the Kanana model family, presents substantial enhancements in **coding, mathematics, and function calling capabilities** over the previous version, enabling broader application to more complex real-world problems. This new version now can handle __up to 32K tokens length natively and up to 128K tokens using YaRN__, allowing the model to maintain coherence when handling extensive documents or engaging in extended conversations. Furthermore, Kanana 1.5 delivers more natural and accurate conversations through a __refined post-training process__.
|
54 |
|
|
|
|
|
|
|
|
|
|
|
55 |
> [!Note]
|
56 |
> Neither the pre-training nor the post-training data includes Kakao user data.
|
57 |
|