Update README.md
Browse files
README.md
CHANGED
@@ -72,24 +72,33 @@ The following hyperparameters are used:
|
|
72 |
- **max_grad_norm**: 1.0
|
73 |
- **generation_max_length**: 225
|
74 |
|
75 |
-
|
76 |
|
77 |
-
We evaluated the speculative decoding setup for Whisper large-v3-singlish on [SASRBench-v1](https://huggingface.co/datasets/mjwong/SASRBench-v1), a benchmark dataset for evaluating ASR performance on Singlish:
|
78 |
|
79 |
-
|
80 |
|
81 |
-
|
82 |
-
|----------------------------------------------------------------------------------------------------|---------------|-----------|
|
83 |
-
| [Whisper-large-v3-singlish](https://huggingface.co/mjwong/whisper-large-v3-singlish) | 1.00 | 16.41% |
|
84 |
-
| [Whisper-large-v3-turbo-singlish](https://huggingface.co/mjwong/whisper-large-v3-turbo-singlish) | 2.36 | 13.35% |
|
85 |
-
| Whisper-large-v3-singlish + [DRAFT](https://huggingface.co/mjwong/whisper-large-v3-singlish-DRAFT) | 2.20 | 14.84% |
|
86 |
|
|
|
87 |
|
88 |
-
|
89 |
|
90 |
-
| **
|
91 |
-
|
92 |
-
| Whisper-large-v3-singlish
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
93 |
|
94 |
## Disclaimer
|
95 |
|
|
|
72 |
- **max_grad_norm**: 1.0
|
73 |
- **generation_max_length**: 225
|
74 |
|
75 |
+
## Benchmark Performance
|
76 |
|
|
|
77 |
|
78 |
+
We evaluated the speculative decoding setup for Whisper-large-v3-singlish on the following datasets:
|
79 |
|
80 |
+
- [SASRBench-v1](https://huggingface.co/datasets/mjwong/SASRBench-v1): A benchmark dataset for evaluating ASR performance on Singlish.
|
|
|
|
|
|
|
|
|
81 |
|
82 |
+
- [AMI](https://huggingface.co/datasets/edinburghcstr/ami): A widely used dataset for meeting transcription and diarization tasks.
|
83 |
|
84 |
+
### Model Performance
|
85 |
|
86 |
+
| **Dataset** | **Model Variant** | **Link** | **Rel. RTFx** | **WER** |
|
87 |
+
|-----------------|---------------------------|--------------------------------------------------------------------------------------------------|---------------|------------|
|
88 |
+
| SASRBench-v1 | Large | [Whisper-large-v3-singlish](https://huggingface.co/mjwong/whisper-large-v3-singlish) | 1.00 | 16.41% |
|
89 |
+
| SASRBench-v1 | Large-Turbo | [Whisper-large-v3-turbo-singlish](https://huggingface.co/mjwong/whisper-large-v3-turbo-singlish) | **2.36** | **13.35%** |
|
90 |
+
| SASRBench-v1 | Draft-enhanced Large | Whisper-large-v3-singlish + [DRAFT](https://huggingface.co/mjwong/whisper-large-v3-singlish-DRAFT) | 2.20 | 14.84% |
|
91 |
+
||||||
|
92 |
+
| AMI | Large | [Whisper-large-v3-singlish](https://huggingface.co/mjwong/whisper-large-v3-singlish) | 1.00 | 23.72% |
|
93 |
+
| AMI | Large-Turbo | [Whisper-large-v3-turbo-singlish](https://huggingface.co/mjwong/whisper-large-v3-turbo-singlish) | 1.53 | **16.99%** |
|
94 |
+
| AMI | Draft-enhanced Large | Whisper-large-v3-singlish + [DRAFT](https://huggingface.co/mjwong/whisper-large-v3-singlish-DRAFT) | **2.27** | 22.06% |
|
95 |
+
|
96 |
+
### Speculative Acceptance Rates (DRAFT-enhanced Large Model)
|
97 |
+
|
98 |
+
| **Dataset** | **Micro Avg Acceptance** | **Macro Avg Acceptance** |
|
99 |
+
|----------------|--------------------------|---------------------------|
|
100 |
+
| SASRBench-v1 | 38.00% | 42.00% |
|
101 |
+
| AMI | 38.00% | 43.00% |
|
102 |
|
103 |
## Disclaimer
|
104 |
|