pr0mila-gh0sh commited on
Commit
82a4eee
·
verified ·
1 Parent(s): d24c3e1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -42,6 +42,8 @@ tags:
42
  ## Model Description
43
  **Medibeng Whisper Tiny** is a fine-tuned version of the Whisper model for **automatic speech recognition (ASR)**, specifically designed to **transcribe and translate code-switched Bengali-English** conversations into **English**. This model is designed for **clinical settings** and can handle audio that contains a mix of Bengali and English, making it suitable for transcription and **translation** tasks in multilingual environments, such as medical and healthcare settings.
44
 
 
 
45
  ## Usage
46
  To use the **Medibeng Whisper Tiny** model for translating code-switched Bengali-English conversations into English, follow this example:
47
 
@@ -109,7 +111,7 @@ The model was fine-tuned on the **MediBeng** dataset, which consists of **code-s
109
 
110
  - **Dataset Size**: 20% of the **MediBeng** dataset was used for fine-tuning. The dataset is available on [Hugging Face](https://huggingface.co/datasets/pr0mila-gh0sh/MediBeng).
111
  - **Data Source**: [MediBeng dataset](https://huggingface.co/datasets/pr0mila-gh0sh/MediBeng)
112
- - **Data Process Source**: [ParquetToHuggingFace](https://github.com/pr0mila/ParquetToHuggingFace)
113
  - **Data Characteristics**: The dataset contains conversational speech with both **Bengali** and **English**, with specific focus on medical terminologies and clinical dialogues.
114
 
115
  ## Evaluation Results
 
42
  ## Model Description
43
  **Medibeng Whisper Tiny** is a fine-tuned version of the Whisper model for **automatic speech recognition (ASR)**, specifically designed to **transcribe and translate code-switched Bengali-English** conversations into **English**. This model is designed for **clinical settings** and can handle audio that contains a mix of Bengali and English, making it suitable for transcription and **translation** tasks in multilingual environments, such as medical and healthcare settings.
44
 
45
+ **Repository:** https://github.com/pr0mila/MediBeng-Whisper-Tiny
46
+
47
  ## Usage
48
  To use the **Medibeng Whisper Tiny** model for translating code-switched Bengali-English conversations into English, follow this example:
49
 
 
111
 
112
  - **Dataset Size**: 20% of the **MediBeng** dataset was used for fine-tuning. The dataset is available on [Hugging Face](https://huggingface.co/datasets/pr0mila-gh0sh/MediBeng).
113
  - **Data Source**: [MediBeng dataset](https://huggingface.co/datasets/pr0mila-gh0sh/MediBeng)
114
+ - **Data Process Source**: https://github.com/pr0mila/ParquetToHuggingFace
115
  - **Data Characteristics**: The dataset contains conversational speech with both **Bengali** and **English**, with specific focus on medical terminologies and clinical dialogues.
116
 
117
  ## Evaluation Results