Improve model card with library name and primary paper link (#3)

Browse files

- Improve model card with library name and primary paper link (2429b4407e5c2a8d9666c3f6d604ec7b61483206)

Co-authored-by: Niels Rogge <[email protected]>

Files changed (1) hide show

README.md +24 -22

README.md CHANGED Viewed

@@ -1,21 +1,23 @@
 ---
-language:
-- fi
-license: apache-2.0
-tags:
-- finnish
-- llama
 datasets:
 - Finnish-NLP/mc4_3.1.0_fi_cleaned
 - Finnish-NLP/oscar_2301_fi_cleaned
 - Finnish-NLP/Reddit_fi_2006_2022
 - Finnish-NLP/wikipedia_20230501_fi_cleaned
 - intfloat/multilingual_cc_news
-inference: false
 pipeline_tag: text-generation
 ---
 # Llama-3b for Finnish
 Pretrained Llama model on Finnish language using a causal language modeling (CLM) objective. Llama model was introduced in
@@ -106,18 +108,18 @@ The final training dataset had 19 billion words and the evaluation dataset had 2
 |Dataset                       | Words       | Ratio       |
 |------------------------------|-------------|-------------|
-|mc4_3.1.0_fi_cleaned          | 11.462B     | 60.7\%      |
-|oscar_2301_fi_cleaned         | 3.295B      | 17.4\%      |
-|Suomi24                       | 3.045B      | 16.1\%      |
-|multilingual_cc_news          | 0.295B      | 1.6\%       |
-|STT                           | 0.249B      | 1.3\%       |
-|Yle                           | 0.201B      | 1.1\%       |
-|Reddit_fi_2006_2022           | 0.138B      | 0.7\%       |
-|wikipedia_20230501_fi_cleaned | 0.096B      | 0.5\%       |
-|Project Lönnrot               | 0.078B      | 0.4\%       |
-|Finnish parliament speeches   | 0.021B      | 0.1\%       |
-|fi-news-corpus                | 0.004B      | 0.1\%       |
-|**TOTAL**                     | **18.884B** | **100.0\%** |
 ## Training procedure
@@ -180,7 +182,7 @@ This model was evaluated using [FIN-bench by TurkuNLP](https://github.com/TurkuN
 [llama-7b-finnish](https://huggingface.co/Finnish-NLP/llama-7b-finnish):
 |                      Task                      |Version|       Metric        |Value |   |Stderr|
-|------------------------------------------------|------:|---------------------|-----:|---|-----:|
 |bigbench_analogies                              |      0|multiple_choice_grade|0.2692|±  |0.0391|
 |bigbench_arithmetic_1_digit_addition            |      0|multiple_choice_grade|0.2600|±  |0.0441|
 |bigbench_arithmetic_1_digit_division            |      0|multiple_choice_grade|0.3043|±  |0.0981|
@@ -229,4 +231,4 @@ This project would not have been possible without compute generously provided by
 - Aapo Tanskanen, [Hugging Face profile](https://huggingface.co/aapot), [LinkedIn profile](https://www.linkedin.com/in/aapotanskanen/)
 - Rasmus Toivanen, [Hugging Face profile](https://huggingface.co/RASMUS), [LinkedIn profile](https://www.linkedin.com/in/rasmustoivanen/)
-Feel free to contact us for more details 🤗

 ---
 datasets:
 - Finnish-NLP/mc4_3.1.0_fi_cleaned
 - Finnish-NLP/oscar_2301_fi_cleaned
 - Finnish-NLP/Reddit_fi_2006_2022
 - Finnish-NLP/wikipedia_20230501_fi_cleaned
 - intfloat/multilingual_cc_news
+language:
+- fi
+license: apache-2.0
 pipeline_tag: text-generation
+library_name: transformers
+tags:
+- finnish
+- llama
+inference: false
 ---
+This is the Llama-3b for Finnish model described in the paper [Scaling Data-Constrained Language Models](https://huggingface.co/papers/2305.16264).
 # Llama-3b for Finnish
 Pretrained Llama model on Finnish language using a causal language modeling (CLM) objective. Llama model was introduced in
 |Dataset                       | Words       | Ratio       |
 |------------------------------|-------------|-------------|
+|mc4_3.1.0_fi_cleaned          | 11.462B     | 60.7%      |
+|oscar_2301_fi_cleaned         | 3.295B      | 17.4%      |
+|Suomi24                       | 3.045B      | 16.1%      |
+|multilingual_cc_news          | 0.295B      | 1.6%       |
+|STT                           | 0.249B      | 1.3%       |
+|Yle                           | 0.201B      | 1.1%       |
+|Reddit_fi_2006_2022           | 0.138B      | 0.7%       |
+|wikipedia_20230501_fi_cleaned | 0.096B      | 0.5%       |
+|Project Lönnrot               | 0.078B      | 0.4%       |
+|Finnish parliament speeches   | 0.021B      | 0.1%       |
+|fi-news-corpus                | 0.004B      | 0.1%       |
+|**TOTAL**                     | **18.884B** | **100.0%** |
 ## Training procedure
 [llama-7b-finnish](https://huggingface.co/Finnish-NLP/llama-7b-finnish):
 |                      Task                      |Version|       Metric        |Value |   |Stderr|
+|------------------------------------------------|------:|---------------------|-----:|---|-----:|\
 |bigbench_analogies                              |      0|multiple_choice_grade|0.2692|±  |0.0391|
 |bigbench_arithmetic_1_digit_addition            |      0|multiple_choice_grade|0.2600|±  |0.0441|
 |bigbench_arithmetic_1_digit_division            |      0|multiple_choice_grade|0.3043|±  |0.0981|
 - Aapo Tanskanen, [Hugging Face profile](https://huggingface.co/aapot), [LinkedIn profile](https://www.linkedin.com/in/aapotanskanen/)
 - Rasmus Toivanen, [Hugging Face profile](https://huggingface.co/RASMUS), [LinkedIn profile](https://www.linkedin.com/in/rasmustoivanen/)
+Feel free to contact us for more details 🤗