cjvt
/

dvres commited on
Commit
7144b02
·
verified ·
1 Parent(s): 2bc0fd3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +52 -2
README.md CHANGED
@@ -5,8 +5,58 @@ tags: []
5
 
6
  # Model Card for Model ID
7
 
8
- <!-- Provide a quick summary of what the model is/does. -->
9
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
 
11
 
12
  ## Model Details
 
5
 
6
  # Model Card for Model ID
7
 
8
+ GaMS-9B after the second round of training - high quality Slovene corpora. The training was done using NeMo 2.0. To fill the batches, the EOS token was added.
9
+
10
+ Starting model: GaMS-Beta/GaMS-Parallel-2.0
11
+
12
+ ## Data
13
+
14
+ | Corpus | Language | # Tokens | Percentage |
15
+ | :----- | :------- | :------: | :--------: |
16
+ | KAS | Slovene | 2.77 B | 20.34 % |
17
+ | Metafida | Slovene | 4.66 B | 34.18 % |
18
+ | Wikipedia-En | English | 5.45 B | 39.99 % |
19
+ | Wikipedia-Sl | Slovene | 0.16 B | 1.19 % |
20
+ | Wikipedia-Hr | Croatian | 0.15 B | 1.13 % |
21
+ | Wikipedia-Bs | Bosnian | 0.07 B | 0.50 % |
22
+ | Wikipedia-Sr-Latin | Serbian | 0.36 B | 2.68 % |
23
+ | Total | | 13.62 B | |
24
+
25
+ ## Slovenian-LLM-Eval results
26
+
27
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/652d40a78fa1fbb0aae165bb/YszAknPaoxiBR0c_7Gu3U.png)
28
+
29
+ ## Slobench Results
30
+
31
+ The reported results were obtained using guided decoding.
32
+
33
+ ### 0-shot results
34
+
35
+ | | Model | BoolQ_accuracy | MultiRC_exact_match | MultiRC_per_question_f1 | MultiRC_f1_over_all_answers | WSC_accuracy | COPA_accuracy | RTE_accuracy | CB_accuracy | CB_f1 | NLI_accuracy | NLI_precision_entailment | NLI_recall_entailment | NLI_f1_entailment | NLI_precision_neutral | NLI_recall_neutral | NLI_f1_neutral | NLI_precision_contradiction | NLI_recall_contradiction | NLI_f1_contradiction |
36
+ |--:|:---------------------------------------|:------------------|:----------------------|:--------------------------|:------------------------------|:------------------|:------------------|:------------------|:------------------|:------------------|:------------------|:---------------------------|:------------------------|:--------------------|:------------------------|:---------------------|:------------------|:------------------------------|:---------------------------|:-----------------------|
37
+ | 0 | /models/hf_models/GaMS-9B-Parallel-2.0 | 0.76 [0.74, 0.77] | 0.21 [0.19, 0.24] | 0.56 [0.54, 0.58] | 0.55 [0.53, 0.57] | 0.38 [0.28, 0.47] | 0.6 [0.5, 0.7] | 0.6 [0.54, 0.66] | 0.68 [0.55, 0.8] | 0.59 [0.39, 0.76] | 0.35 [0.31, 0.39] | 0.5 [0.23, 0.77] | 0.04 [0.02, 0.07] | 0.08 [0.03, 0.13] | 0.33 [0.29, 0.37] | 0.85 [0.79, 0.9] | 0.47 [0.42, 0.52] | 0.41 [0.31, 0.52] | 0.19 [0.13, 0.24] | 0.26 [0.18, 0.32] |
38
+ | 1 | google/gemma-2-9b | 0.78 [0.77, 0.8] | 0.2 [0.18, 0.23] | 0.48 [0.45, 0.5] | 0.49 [0.48, 0.51] | 0.63 [0.54, 0.73] | 0.68 [0.59, 0.77] | 0.69 [0.63, 0.74] | 0.55 [0.42, 0.69] | 0.31 [0.22, 0.41] | 0.33 [0.29, 0.37] | 0.55 [0.25, 0.83] | 0.03 [0.01, 0.06] | 0.06 [0.02, 0.11] | 0.32 [0.28, 0.36] | 0.98 [0.95, 0.99] | 0.48 [0.44, 0.52] | 0.57 [0.14, 1.0] | 0.02 [0.01, 0.04] | 0.04 [0.01, 0.08] |
39
+ | 2 | google/gemma-2-9b-it | 0.83 [0.82, 0.84] | 0.18 [0.16, 0.2] | 0.6 [0.58, 0.62] | 0.5 [0.49, 0.52] | 0.62 [0.52, 0.71] | 0.86 [0.79, 0.93] | 0.8 [0.75, 0.85] | 0.82 [0.72, 0.92] | 0.73 [0.58, 0.85] | 0.47 [0.43, 0.51] | 0.48 [0.28, 0.68] | 0.06 [0.03, 0.1] | 0.11 [0.05, 0.17] | 0.4 [0.35, 0.46] | 0.66 [0.59, 0.74] | 0.5 [0.44, 0.56] | 0.55 [0.49, 0.62] | 0.72 [0.66, 0.78] | 0.62 [0.57, 0.68] |
40
+ | 3 | zlatorog/Zlatorog_SFT_v2 | 0.82 [0.81, 0.84] | 0.35 [0.32, 0.38] | 0.72 [0.7, 0.74] | 0.71 [0.69, 0.72] | 0.65 [0.56, 0.75] | 0.79 [0.71, 0.87] | 0.79 [0.74, 0.84] | 0.77 [0.65, 0.88] | 0.63 [0.46, 0.82] | 0.63 [0.59, 0.67] | 0.54 [0.48, 0.59] | 0.93 [0.9, 0.96] | 0.68 [0.63, 0.73] | 0.8 [0.25, 1.0] | 0.02 [0.01, 0.05] | 0.04 [0.01, 0.1] | 0.79 [0.73, 0.84] | 0.9 [0.86, 0.94] | 0.84 [0.8, 0.88] |
41
+ | 4 | cjvt/GaMS-1B | 0.52 [0.5, 0.54] | 0.01 [0.0, 0.01] | 0.05 [0.04, 0.05] | 0.04 [0.04, 0.05] | 0.61 [0.51, 0.7] | 0.55 [0.45, 0.65] | 0.48 [0.42, 0.54] | 0.21 [0.1, 0.33] | 0.19 [0.09, 0.27] | 0.33 [0.29, 0.36] | 0.34 [0.28, 0.4] | 0.43 [0.36, 0.5] | 0.38 [0.32, 0.44] | 0.33 [0.25, 0.41] | 0.26 [0.2, 0.33] | 0.29 [0.22, 0.35] | 0.3 [0.23, 0.37] | 0.28 [0.21, 0.34] | 0.29 [0.22, 0.35] |
42
+ | 5 | cjvt/GaMS-1B-Chat | 0.62 [0.61, 0.64] | 0.03 [0.02, 0.04] | 0.13 [0.12, 0.14] | 0.07 [0.07, 0.08] | 0.5 [0.4, 0.6] | 0.55 [0.45, 0.65] | 0.47 [0.41, 0.53] | 0.5 [0.36, 0.64] | 0.22 [0.18, 0.26] | 0.35 [0.31, 0.39] | 0.35 [0.31, 0.39] | 0.99 [0.98, 1.0] | 0.52 [0.47, 0.56] | 0.0 [0.0, 0.0] | 0.0 [0.0, 0.0] | 0.0 [0.0, 0.0] | 0.0 [0.0, 0.0] | 0.0 [0.0, 0.0] | 0.0 [0.0, 0.0] |
43
+ | 6 | utter-project/EuroLLM-9B | 0.77 [0.76, 0.79] | 0.12 [0.1, 0.14] | 0.53 [0.52, 0.55] | 0.53 [0.52, 0.55] | 0.55 [0.45, 0.65] | 0.78 [0.7, 0.86] | 0.73 [0.68, 0.79] | 0.79 [0.67, 0.9] | 0.55 [0.48, 0.62] | 0.39 [0.34, 0.43] | 0.64 [0.33, 1.0] | 0.04 [0.01, 0.07] | 0.07 [0.02, 0.12] | 0.34 [0.3, 0.38] | 0.95 [0.92, 0.98] | 0.5 [0.45, 0.54] | 0.87 [0.76, 0.96] | 0.22 [0.15, 0.28] | 0.35 [0.26, 0.42] |
44
+ | 7 | utter-project/EuroLLM-9B-Instruct | 0.81 [0.79, 0.82] | 0.18 [0.15, 0.2] | 0.64 [0.62, 0.65] | 0.64 [0.62, 0.66] | 0.6 [0.5, 0.69] | 0.58 [0.48, 0.68] | 0.82 [0.77, 0.87] | 0.77 [0.65, 0.88] | 0.63 [0.46, 0.82] | 0.38 [0.34, 0.42] | 0.46 [0.41, 0.52] | 0.76 [0.69, 0.82] | 0.57 [0.52, 0.62] | 0.24 [0.18, 0.3] | 0.21 [0.15, 0.26] | 0.22 [0.16, 0.28] | 0.31 [0.21, 0.42] | 0.14 [0.09, 0.2] | 0.2 [0.13, 0.26] |
45
+ | 8 | /models/hf_models/GaMS-9B-SecondRound | 0.78 [0.76, 0.79] | 0.23 [0.21, 0.26] | 0.63 [0.61, 0.65] | 0.53 [0.52, 0.55] | 0.59 [0.49, 0.68] | 0.65 [0.55, 0.75] | 0.77 [0.72, 0.82] | 0.68 [0.55, 0.8] | 0.64 [0.52, 0.76] | 0.46 [0.42, 0.5] | 0.49 [0.42, 0.57] | 0.45 [0.38, 0.52] | 0.47 [0.41, 0.53] | 0.33 [0.25, 0.4] | 0.29 [0.23, 0.36] | 0.31 [0.24, 0.37] | 0.53 [0.46, 0.6] | 0.62 [0.56, 0.7] | 0.57 [0.51, 0.63] |
46
+
47
+ ### 3-shot results
48
+
49
+ | | Model | BoolQ_accuracy | MultiRC_exact_match | MultiRC_per_question_f1 | MultiRC_f1_over_all_answers | WSC_accuracy | COPA_accuracy | RTE_accuracy | CB_accuracy | CB_f1 | NLI_accuracy | NLI_precision_entailment | NLI_recall_entailment | NLI_f1_entailment | NLI_precision_neutral | NLI_recall_neutral | NLI_f1_neutral | NLI_precision_contradiction | NLI_recall_contradiction | NLI_f1_contradiction |
50
+ |--:|:---------------------------------------|:------------------|:----------------------|:--------------------------|:------------------------------|:------------------|:------------------|:------------------|:------------------|:------------------|:------------------|:---------------------------|:------------------------|:--------------------|:------------------------|:---------------------|:------------------|:------------------------------|:---------------------------|:-----------------------|
51
+ | 0 | /models/hf_models/GaMS-9B-Parallel-2.0 | 0.83 [0.81, 0.84] | 0.36 [0.33, 0.39] | 0.74 [0.72, 0.75] | 0.74 [0.73, 0.76] | 0.64 [0.55, 0.74] | 0.87 [0.8, 0.94] | 0.78 [0.73, 0.83] | 0.84 [0.74, 0.94] | 0.59 [0.53, 0.64] | 0.48 [0.43, 0.52] | 0.38 [0.21, 0.56] | 0.07 [0.03, 0.11] | 0.11 [0.06, 0.18] | 0.37 [0.32, 0.42] | 0.67 [0.6, 0.74] | 0.48 [0.42, 0.53] | 0.66 [0.59, 0.73] | 0.72 [0.66, 0.78] | 0.69 [0.63, 0.75] |
52
+ | 1 | google/gemma-2-9b | 0.82 [0.81, 0.83] | 0.37 [0.34, 0.4] | 0.75 [0.73, 0.77] | 0.75 [0.74, 0.77] | 0.66 [0.57, 0.76] | 0.88 [0.82, 0.94] | 0.79 [0.74, 0.84] | 0.88 [0.79, 0.96] | 0.61 [0.56, 0.65] | 0.48 [0.44, 0.52] | 0.52 [0.37, 0.65] | 0.15 [0.1, 0.19] | 0.23 [0.15, 0.29] | 0.37 [0.32, 0.42] | 0.77 [0.71, 0.83] | 0.5 [0.45, 0.55] | 0.75 [0.68, 0.82] | 0.56 [0.48, 0.63] | 0.64 [0.57, 0.71] |
53
+ | 2 | google/gemma-2-9b-it | 0.84 [0.83, 0.85] | 0.15 [0.13, 0.18] | 0.66 [0.64, 0.67] | 0.65 [0.64, 0.67] | 0.7 [0.61, 0.79] | 0.89 [0.83, 0.95] | 0.82 [0.78, 0.87] | 0.84 [0.74, 0.94] | 0.75 [0.59, 0.88] | 0.65 [0.61, 0.69] | 0.66 [0.6, 0.72] | 0.77 [0.71, 0.83] | 0.71 [0.66, 0.76] | 0.56 [0.46, 0.68] | 0.28 [0.22, 0.35] | 0.37 [0.3, 0.45] | 0.68 [0.61, 0.74] | 0.88 [0.83, 0.92] | 0.76 [0.72, 0.81] |
54
+ | 3 | zlatorog/Zlatorog_SFT_v2 | 0.83 [0.82, 0.84] | 0.04 [0.02, 0.05] | 0.54 [0.52, 0.55] | 0.55 [0.53, 0.56] | 0.51 [0.41, 0.61] | 0.66 [0.57, 0.75] | 0.73 [0.67, 0.78] | 0.7 [0.57, 0.82] | 0.48 [0.4, 0.56] | 0.56 [0.52, 0.6] | 0.55 [0.49, 0.62] | 0.62 [0.55, 0.69] | 0.58 [0.53, 0.64] | 0.39 [0.31, 0.49] | 0.24 [0.19, 0.31] | 0.3 [0.23, 0.37] | 0.64 [0.58, 0.71] | 0.79 [0.73, 0.85] | 0.71 [0.66, 0.76] |
55
+ | 4 | cjvt/GaMS-1B | 0.49 [0.47, 0.5] | 0.07 [0.06, 0.09] | 0.41 [0.39, 0.43] | 0.37 [0.35, 0.39] | 0.58 [0.48, 0.67] | 0.49 [0.39, 0.59] | 0.47 [0.41, 0.53] | 0.43 [0.29, 0.56] | 0.21 [0.17, 0.25] | 0.32 [0.28, 0.36] | 0.36 [0.27, 0.44] | 0.22 [0.16, 0.29] | 0.27 [0.21, 0.34] | 0.31 [0.27, 0.36] | 0.77 [0.7, 0.83] | 0.44 [0.39, 0.49] | 0.0 [0.0, 0.0] | 0.0 [0.0, 0.0] | 0.0 [0.0, 0.0] |
56
+ | 5 | cjvt/GaMS-1B-Chat | 0.59 [0.57, 0.61] | 0.04 [0.02, 0.05] | 0.34 [0.32, 0.36] | 0.16 [0.15, 0.16] | 0.63 [0.54, 0.73] | 0.55 [0.45, 0.65] | 0.47 [0.41, 0.53] | 0.43 [0.29, 0.56] | 0.2 [0.16, 0.24] | 0.36 [0.32, 0.4] | 0.36 [0.32, 0.4] | 0.99 [0.97, 1.0] | 0.53 [0.48, 0.57] | 0.47 [0.2, 0.73] | 0.04 [0.01, 0.07] | 0.07 [0.02, 0.13] | 0.0 [0.0, 0.0] | 0.0 [0.0, 0.0] | 0.0 [0.0, 0.0] |
57
+ | 6 | utter-project/EuroLLM-9B | 0.81 [0.79, 0.82] | 0.17 [0.15, 0.2] | 0.61 [0.59, 0.63] | 0.58 [0.57, 0.6] | 0.65 [0.56, 0.75] | 0.63 [0.53, 0.73] | 0.73 [0.67, 0.78] | 0.73 [0.61, 0.85] | 0.59 [0.47, 0.71] | 0.45 [0.41, 0.49] | 1.0 [0.0, 1.0] | 0.02 [0.0, 0.04] | 0.03 [0.0, 0.07] | 0.38 [0.32, 0.45] | 0.53 [0.46, 0.61] | 0.44 [0.38, 0.51] | 0.49 [0.44, 0.55] | 0.83 [0.78, 0.88] | 0.62 [0.57, 0.67] |
58
+ | 7 | utter-project/EuroLLM-9B-Instruct | 0.81 [0.79, 0.82] | 0.07 [0.05, 0.08] | 0.52 [0.5, 0.53] | 0.53 [0.51, 0.55] | 0.65 [0.56, 0.75] | 0.69 [0.6, 0.78] | 0.79 [0.74, 0.84] | 0.79 [0.67, 0.9] | 0.71 [0.56, 0.84] | 0.42 [0.38, 0.47] | 0.61 [0.53, 0.69] | 0.48 [0.41, 0.55] | 0.53 [0.47, 0.6] | 0.31 [0.26, 0.36] | 0.57 [0.5, 0.64] | 0.4 [0.35, 0.45] | 0.54 [0.43, 0.65] | 0.23 [0.17, 0.29] | 0.32 [0.25, 0.39] |
59
+ | 8 | /models/hf_models/GaMS-9B-SecondRound | 0.8 [0.78, 0.81] | 0.15 [0.13, 0.17] | 0.64 [0.62, 0.65] | 0.58 [0.57, 0.6] | 0.66 [0.57, 0.76] | 0.9 [0.84, 0.96] | 0.82 [0.77, 0.87] | 0.84 [0.74, 0.94] | 0.73 [0.56, 0.87] | 0.56 [0.52, 0.6] | 0.58 [0.5, 0.66] | 0.38 [0.31, 0.46] | 0.46 [0.39, 0.53] | 0.42 [0.35, 0.5] | 0.44 [0.37, 0.51] | 0.43 [0.37, 0.49] | 0.65 [0.59, 0.71] | 0.86 [0.8, 0.91] | 0.74 [0.69, 0.79] |
60
 
61
 
62
  ## Model Details