nonemonehpark commited on
Commit
22637f1
·
verified ·
1 Parent(s): 509df9a

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +123 -0
README.md ADDED
@@ -0,0 +1,123 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - ko
5
+ library_name: transformers
6
+ license: cc-by-nc-4.0
7
+ pipeline_tag: text-generation
8
+ model_id: kakaocorp/kanana-nano-2.1b-base
9
+ repo: kakaocorp/kanana-nano-2.1b-base
10
+ developers: Kanana LLM
11
+ training_regime: bf16 mixed precision
12
+ tags:
13
+ - llama-cpp
14
+ - gguf-my-repo
15
+ base_model: kakaocorp/kanana-nano-2.1b-base
16
+ model-index:
17
+ - name: kanana-nano-2.1b-base
18
+ results:
19
+ - task:
20
+ type: multiple_choice
21
+ name: mmlu
22
+ dataset:
23
+ name: mmlu (5-shots)
24
+ type: hails/mmlu_no_train
25
+ metrics:
26
+ - type: acc
27
+ value: 54.83
28
+ name: acc
29
+ - task:
30
+ type: generate_until
31
+ name: kmmlu
32
+ dataset:
33
+ name: kmmlu-direct (5-shots)
34
+ type: HAERAE-HUB/KMMLU
35
+ metrics:
36
+ - type: exact_match
37
+ value: 44.83
38
+ name: exact_match
39
+ - task:
40
+ type: multiple_choice
41
+ name: haerae
42
+ dataset:
43
+ name: haerae (5-shots)
44
+ type: HAERAE-HUB/HAE_RAE_BENCH
45
+ metrics:
46
+ - type: acc_norm
47
+ value: 77.09
48
+ name: acc_norm
49
+ - task:
50
+ type: generate_until
51
+ name: gsm8k
52
+ dataset:
53
+ name: gsm8k (5-shots)
54
+ type: openai/gsm8k
55
+ metrics:
56
+ - type: exact_match
57
+ value: 46.32
58
+ name: exact_match_strict
59
+ - task:
60
+ type: generate_until
61
+ name: humaneval
62
+ dataset:
63
+ name: humaneval (0-shots)
64
+ type: openai/openai_humaneval
65
+ metrics:
66
+ - type: pass@1
67
+ value: 31.1
68
+ name: pass@1
69
+ - task:
70
+ type: generate_until
71
+ name: mbpp
72
+ dataset:
73
+ name: mbpp (3-shots)
74
+ type: google-research-datasets/mbpp
75
+ metrics:
76
+ - type: pass@1
77
+ value: 46.2
78
+ name: pass@1
79
+ ---
80
+
81
+ # nonemonehpark/kanana-nano-2.1b-base-Q8_0-GGUF
82
+ This model was converted to GGUF format from [`kakaocorp/kanana-nano-2.1b-base`](https://huggingface.co/kakaocorp/kanana-nano-2.1b-base) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
83
+ Refer to the [original model card](https://huggingface.co/kakaocorp/kanana-nano-2.1b-base) for more details on the model.
84
+
85
+ ## Use with llama.cpp
86
+ Install llama.cpp through brew (works on Mac and Linux)
87
+
88
+ ```bash
89
+ brew install llama.cpp
90
+
91
+ ```
92
+ Invoke the llama.cpp server or the CLI.
93
+
94
+ ### CLI:
95
+ ```bash
96
+ llama-cli --hf-repo nonemonehpark/kanana-nano-2.1b-base-Q8_0-GGUF --hf-file kanana-nano-2.1b-base-q8_0.gguf -p "The meaning to life and the universe is"
97
+ ```
98
+
99
+ ### Server:
100
+ ```bash
101
+ llama-server --hf-repo nonemonehpark/kanana-nano-2.1b-base-Q8_0-GGUF --hf-file kanana-nano-2.1b-base-q8_0.gguf -c 2048
102
+ ```
103
+
104
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
105
+
106
+ Step 1: Clone llama.cpp from GitHub.
107
+ ```
108
+ git clone https://github.com/ggerganov/llama.cpp
109
+ ```
110
+
111
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
112
+ ```
113
+ cd llama.cpp && LLAMA_CURL=1 make
114
+ ```
115
+
116
+ Step 3: Run inference through the main binary.
117
+ ```
118
+ ./llama-cli --hf-repo nonemonehpark/kanana-nano-2.1b-base-Q8_0-GGUF --hf-file kanana-nano-2.1b-base-q8_0.gguf -p "The meaning to life and the universe is"
119
+ ```
120
+ or
121
+ ```
122
+ ./llama-server --hf-repo nonemonehpark/kanana-nano-2.1b-base-Q8_0-GGUF --hf-file kanana-nano-2.1b-base-q8_0.gguf -c 2048
123
+ ```