Update README.md
Browse files
README.md
CHANGED
@@ -42,6 +42,8 @@ tags:
|
|
42 |
## Model Description
|
43 |
**Medibeng Whisper Tiny** is a fine-tuned version of the Whisper model for **automatic speech recognition (ASR)**, specifically designed to **transcribe and translate code-switched Bengali-English** conversations into **English**. This model is designed for **clinical settings** and can handle audio that contains a mix of Bengali and English, making it suitable for transcription and **translation** tasks in multilingual environments, such as medical and healthcare settings.
|
44 |
|
|
|
|
|
45 |
## Usage
|
46 |
To use the **Medibeng Whisper Tiny** model for translating code-switched Bengali-English conversations into English, follow this example:
|
47 |
|
@@ -109,7 +111,7 @@ The model was fine-tuned on the **MediBeng** dataset, which consists of **code-s
|
|
109 |
|
110 |
- **Dataset Size**: 20% of the **MediBeng** dataset was used for fine-tuning. The dataset is available on [Hugging Face](https://huggingface.co/datasets/pr0mila-gh0sh/MediBeng).
|
111 |
- **Data Source**: [MediBeng dataset](https://huggingface.co/datasets/pr0mila-gh0sh/MediBeng)
|
112 |
-
- **Data Process Source**:
|
113 |
- **Data Characteristics**: The dataset contains conversational speech with both **Bengali** and **English**, with specific focus on medical terminologies and clinical dialogues.
|
114 |
|
115 |
## Evaluation Results
|
|
|
42 |
## Model Description
|
43 |
**Medibeng Whisper Tiny** is a fine-tuned version of the Whisper model for **automatic speech recognition (ASR)**, specifically designed to **transcribe and translate code-switched Bengali-English** conversations into **English**. This model is designed for **clinical settings** and can handle audio that contains a mix of Bengali and English, making it suitable for transcription and **translation** tasks in multilingual environments, such as medical and healthcare settings.
|
44 |
|
45 |
+
**Repository:** https://github.com/pr0mila/MediBeng-Whisper-Tiny
|
46 |
+
|
47 |
## Usage
|
48 |
To use the **Medibeng Whisper Tiny** model for translating code-switched Bengali-English conversations into English, follow this example:
|
49 |
|
|
|
111 |
|
112 |
- **Dataset Size**: 20% of the **MediBeng** dataset was used for fine-tuning. The dataset is available on [Hugging Face](https://huggingface.co/datasets/pr0mila-gh0sh/MediBeng).
|
113 |
- **Data Source**: [MediBeng dataset](https://huggingface.co/datasets/pr0mila-gh0sh/MediBeng)
|
114 |
+
- **Data Process Source**: https://github.com/pr0mila/ParquetToHuggingFace
|
115 |
- **Data Characteristics**: The dataset contains conversational speech with both **Bengali** and **English**, with specific focus on medical terminologies and clinical dialogues.
|
116 |
|
117 |
## Evaluation Results
|