mjwong
/

whisper-large-v3-singlish-DRAFT

@@ -72,24 +72,33 @@ The following hyperparameters are used:
 - **max_grad_norm**: 1.0
 - **generation_max_length**: 225
-### Benchmark Performance
-We evaluated the speculative decoding setup for Whisper large-v3-singlish on [SASRBench-v1](https://huggingface.co/datasets/mjwong/SASRBench-v1), a benchmark dataset for evaluating ASR performance on Singlish:
-#### Model Performance
-| **Model**                                                                                          | **Rel. RTFx** | **WER**   |
-|----------------------------------------------------------------------------------------------------|---------------|-----------|
-| [Whisper-large-v3-singlish](https://huggingface.co/mjwong/whisper-large-v3-singlish)               | 1.00          | 16.41%    |
-| [Whisper-large-v3-turbo-singlish](https://huggingface.co/mjwong/whisper-large-v3-turbo-singlish)   | 2.36          | 13.35%    |
-| Whisper-large-v3-singlish + [DRAFT](https://huggingface.co/mjwong/whisper-large-v3-singlish-DRAFT) | 2.20          | 14.84%    |
-#### Speculative Acceptance Rates
-| **Speculative Setup**                                                                                 | **Micro Avg Acceptance**  | **Macro Avg Acceptance** |
-|-------------------------------------------------------------------------------------------------------|---------------------------|--------------------------|
-| Whisper-large-v3-singlish + [DRAFT](https://huggingface.co/mjwong/whisper-large-v3-singlish-DRAFT)    | 38.00%                    | 42.00%                   |
 ## Disclaimer

 - **max_grad_norm**: 1.0
 - **generation_max_length**: 225
+## Benchmark Performance
+We evaluated the speculative decoding setup for Whisper-large-v3-singlish on the following datasets:
+- [SASRBench-v1](https://huggingface.co/datasets/mjwong/SASRBench-v1): A benchmark dataset for evaluating ASR performance on Singlish.
+- [AMI](https://huggingface.co/datasets/edinburghcstr/ami): A widely used dataset for meeting transcription and diarization tasks.
+### Model Performance
+| **Dataset**     | **Model Variant**         | **Link**                                                                                         | **Rel. RTFx** | **WER**    |
+|-----------------|---------------------------|--------------------------------------------------------------------------------------------------|---------------|------------|
+| SASRBench-v1    | Large                     | [Whisper-large-v3-singlish](https://huggingface.co/mjwong/whisper-large-v3-singlish)             | 1.00          | 16.41%     |
+| SASRBench-v1    | Large-Turbo               | [Whisper-large-v3-turbo-singlish](https://huggingface.co/mjwong/whisper-large-v3-turbo-singlish) | **2.36**      | **13.35%** |
+| SASRBench-v1    | Draft-enhanced Large      | Whisper-large-v3-singlish + [DRAFT](https://huggingface.co/mjwong/whisper-large-v3-singlish-DRAFT)                           | 2.20          | 14.84%     |
+||||||
+| AMI             | Large                     | [Whisper-large-v3-singlish](https://huggingface.co/mjwong/whisper-large-v3-singlish)             | 1.00          | 23.72%     |
+| AMI             | Large-Turbo               | [Whisper-large-v3-turbo-singlish](https://huggingface.co/mjwong/whisper-large-v3-turbo-singlish) | 1.53          | **16.99%** |
+| AMI             | Draft-enhanced Large      | Whisper-large-v3-singlish + [DRAFT](https://huggingface.co/mjwong/whisper-large-v3-singlish-DRAFT)                           | **2.27**      | 22.06%     |
+### Speculative Acceptance Rates (DRAFT-enhanced Large Model)
+| **Dataset**    | **Micro Avg Acceptance** | **Macro Avg Acceptance** |
+|----------------|--------------------------|---------------------------|
+| SASRBench-v1   | 38.00%                   | 42.00%                    |
+| AMI            | 38.00%                   | 43.00%                    |
 ## Disclaimer