Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,6 @@ If you’re looking for all datasets, models, or work from our broader team, vis
|
|
15 |
## About the South African Next Voices Project
|
16 |
|
17 |
**ZA-ANV** is building a **3,000-hour** multilingual, multi-domain speech dataset for South Africa, spanning seven local languages.
|
18 |
-
|
19 |
- **Languages:** Setswana, isiZulu, isiXhosa, Sesotho, Sepedi, isiNdebele, Tshivenda
|
20 |
- **Coverage:** 500 hours per language for the main five; 250 hours for isiNdebele and Tshivenda (pilot/experimental scale for future work)
|
21 |
- **Domains:** Broad/general domains to reflect real-world diversity
|
@@ -27,8 +26,7 @@ If you’re looking for all datasets, models, or work from our broader team, vis
|
|
27 |
We work at the intersection of **Data Science for Society** and **Local Language NLP**.
|
28 |
|
29 |
Our mission:
|
30 |
-
|
31 |
-
|
32 |
Find all our work and resources at: [huggingface.co/dsfsi](https://huggingface.co/dsfsi)
|
33 |
|
34 |
**Questions?**
|
|
|
15 |
## About the South African Next Voices Project
|
16 |
|
17 |
**ZA-ANV** is building a **3,000-hour** multilingual, multi-domain speech dataset for South Africa, spanning seven local languages.
|
|
|
18 |
- **Languages:** Setswana, isiZulu, isiXhosa, Sesotho, Sepedi, isiNdebele, Tshivenda
|
19 |
- **Coverage:** 500 hours per language for the main five; 250 hours for isiNdebele and Tshivenda (pilot/experimental scale for future work)
|
20 |
- **Domains:** Broad/general domains to reflect real-world diversity
|
|
|
26 |
We work at the intersection of **Data Science for Society** and **Local Language NLP**.
|
27 |
|
28 |
Our mission:
|
29 |
+
Data-driven collaborative innovation to empower society to tackle challenges and preserve our languages.
|
|
|
30 |
Find all our work and resources at: [huggingface.co/dsfsi](https://huggingface.co/dsfsi)
|
31 |
|
32 |
**Questions?**
|