CohereLabs
/

aya-101

Text Generation

Transformers

Safetensors

text2text-generation

text-generation-inference

Model card Files Files and versions Community

alexrs commited on Apr 15

Commit

231cff3

verified ·

1 Parent(s): 709e97e

Update README.md

Browse files

Files changed (1) hide show

README.md +13 -13

README.md CHANGED Viewed

@@ -1,11 +1,11 @@
 ---
 license: apache-2.0
 datasets:
-  - CohereForAI/xP3x
-  - CohereForAI/aya_dataset
-  - CohereForAI/aya_collection
   - DataProvenanceInitiative/Commercially-Verified-Licenses
-  - CohereForAI/aya_evaluation_suite
 language:
   - afr
   - amh
@@ -121,19 +121,19 @@ metrics:
 > The Aya model is a massively multilingual generative language model that follows instructions in 101 languages.
 > Aya outperforms [mT0](https://huggingface.co/bigscience/mt0-xxl) and [BLOOMZ](https://huggingface.co/bigscience/bloomz) a wide variety of automatic and human evaluations despite covering double the number of languages.
-> The Aya model is trained using [xP3x](https://huggingface.co/datasets/CohereForAI/xP3x), [Aya Dataset](https://huggingface.co/datasets/CohereForAI/aya_dataset), [Aya Collection](https://huggingface.co/datasets/CohereForAI/aya_collection), a subset of [DataProvenance collection](https://huggingface.co/datasets/DataProvenanceInitiative/Commercially-Verified-Licenses) and ShareGPT-Command.
 > We release the checkpoints under a Apache-2.0 license to further our mission of multilingual technologies empowering a
 > multilingual world.
-- **Developed by:** [Cohere For AI](https://cohere.for.ai)
 - **Model type:** a Transformer style autoregressive massively multilingual language model.
 - **Paper**: [Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model](https://arxiv.org/abs/2402.07827)
-- **Point of Contact**: Cohere For AI: [cohere.for.ai](https://cohere.for.ai)
 - **Languages**: Refer to the list of languages in the `language` section of this model card.
 - **License**: Apache-2.0
-- **Model**: [Aya-101](https://huggingface.co/CohereForAI/aya-101)
 - **Model Size**: 13 billion parameters
-- **Datasets**: [xP3x](https://huggingface.co/datasets/CohereForAI/xP3x), [Aya Dataset](https://huggingface.co/datasets/CohereForAI/aya_dataset), [Aya Collection](https://huggingface.co/datasets/CohereForAI/aya_collection), [DataProvenance collection](https://huggingface.co/datasets/DataProvenanceInitiative/Commercially-Verified-Licenses), ShareGPT-Command.
 ## Use
@@ -141,7 +141,7 @@ metrics:
 # pip install -q transformers
 from transformers import AutoModelForSeq2SeqLM, AutoTokenizer
-checkpoint = "CohereForAI/aya-101"
 tokenizer = AutoTokenizer.from_pretrained(checkpoint)
 aya_model = AutoModelForSeq2SeqLM.from_pretrained(checkpoint)
@@ -174,9 +174,9 @@ print(tokenizer.decode(hin_outputs[0]))
 The Aya model is trained on the following datasets:
-- [xP3x](https://huggingface.co/datasets/CohereForAI/xP3x)
-- [Aya Dataset](https://huggingface.co/datasets/CohereForAI/aya_dataset)
-- [Aya Collection](https://huggingface.co/datasets/CohereForAI/aya_collection)
 - [DataProvenance collection](https://huggingface.co/datasets/DataProvenanceInitiative/Commercially-Verified-Licenses)
 - ShareGPT-Command

 ---
 license: apache-2.0
 datasets:
+  - CohereLabs/xP3x
+  - CohereLabs/aya_dataset
+  - CohereLabs/aya_collection
   - DataProvenanceInitiative/Commercially-Verified-Licenses
+  - CohereLabs/aya_evaluation_suite
 language:
   - afr
   - amh
 > The Aya model is a massively multilingual generative language model that follows instructions in 101 languages.
 > Aya outperforms [mT0](https://huggingface.co/bigscience/mt0-xxl) and [BLOOMZ](https://huggingface.co/bigscience/bloomz) a wide variety of automatic and human evaluations despite covering double the number of languages.
+> The Aya model is trained using [xP3x](https://huggingface.co/datasets/CohereLabs/xP3x), [Aya Dataset](https://huggingface.co/datasets/CohereLabs/aya_dataset), [Aya Collection](https://huggingface.co/datasets/CohereForAI/aya_collection), a subset of [DataProvenance collection](https://huggingface.co/datasets/DataProvenanceInitiative/Commercially-Verified-Licenses) and ShareGPT-Command.
 > We release the checkpoints under a Apache-2.0 license to further our mission of multilingual technologies empowering a
 > multilingual world.
+- **Developed by:** [Cohere Labs](https://cohere.for.ai)
 - **Model type:** a Transformer style autoregressive massively multilingual language model.
 - **Paper**: [Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model](https://arxiv.org/abs/2402.07827)
+- **Point of Contact**: [Cohere Labs](https://cohere.for.ai)
 - **Languages**: Refer to the list of languages in the `language` section of this model card.
 - **License**: Apache-2.0
+- **Model**: [Aya-101](https://huggingface.co/CohereLabs/aya-101)
 - **Model Size**: 13 billion parameters
+- **Datasets**: [xP3x](https://huggingface.co/datasets/CohereLabs/xP3x), [Aya Dataset](https://huggingface.co/datasets/CohereLabs/aya_dataset), [Aya Collection](https://huggingface.co/datasets/CohereLabs/aya_collection), [DataProvenance collection](https://huggingface.co/datasets/DataProvenanceInitiative/Commercially-Verified-Licenses), ShareGPT-Command.
 ## Use
 # pip install -q transformers
 from transformers import AutoModelForSeq2SeqLM, AutoTokenizer
+checkpoint = "CohereLabs/aya-101"
 tokenizer = AutoTokenizer.from_pretrained(checkpoint)
 aya_model = AutoModelForSeq2SeqLM.from_pretrained(checkpoint)
 The Aya model is trained on the following datasets:
+- [xP3x](https://huggingface.co/datasets/CohereLabs/xP3x)
+- [Aya Dataset](https://huggingface.co/datasets/CohereLabs/aya_dataset)
+- [Aya Collection](https://huggingface.co/datasets/CohereLabs/aya_collection)
 - [DataProvenance collection](https://huggingface.co/datasets/DataProvenanceInitiative/Commercially-Verified-Licenses)
 - ShareGPT-Command