kakaocorp
/

kanana-1.5-2.1b-base

Text Generation

text-generation-inference

Model card Files Files and versions Community

wavy-jung commited on May 22

Commit

c3b118a

·

verified ·

1 Parent(s): cca2b81

Upload folder using huggingface_hub

Files changed (2) hide show

.gitattributes +0 -3
README.md +0 -5

.gitattributes CHANGED Viewed

@@ -33,6 +33,3 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
-assets/performance/kanana-1.5-radar.png filter=lfs diff=lfs merge=lfs -text
-assets/performance/niah-32.5b-base.png filter=lfs diff=lfs merge=lfs -text
-assets/performance/niah-32.5b-inst.png filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -52,11 +52,6 @@ training_regime: bf16 mixed precision
 `Kanana 1.5`, a newly introduced version of the Kanana model family, presents substantial enhancements in **coding, mathematics, and function calling capabilities** over the previous version, enabling broader application to more complex real-world problems. This new version now can handle __up to 32K tokens length natively and up to 128K tokens using YaRN__, allowing the model to maintain coherence when handling extensive documents or engaging in extended conversations. Furthermore, Kanana 1.5 delivers more natural and accurate conversations through a __refined post-training process__.
-<p align="center">
-<picture>
-    <img src="assets/performance/kanana-1.5-radar.png", width="700" style="margin: 40px auto;">
-</picture>
 > [!Note]
 > Neither the pre-training nor the post-training data includes Kakao user data.

 `Kanana 1.5`, a newly introduced version of the Kanana model family, presents substantial enhancements in **coding, mathematics, and function calling capabilities** over the previous version, enabling broader application to more complex real-world problems. This new version now can handle __up to 32K tokens length natively and up to 128K tokens using YaRN__, allowing the model to maintain coherence when handling extensive documents or engaging in extended conversations. Furthermore, Kanana 1.5 delivers more natural and accurate conversations through a __refined post-training process__.
 > [!Note]
 > Neither the pre-training nor the post-training data includes Kakao user data.