avemio-digital commited on
Commit
0d60cfa
·
verified ·
1 Parent(s): 13f71ff

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -5
README.md CHANGED
@@ -12,21 +12,21 @@ pipeline_tag: automatic-speech-recognition
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
- # GRAG-WHISPER-LARGE-v3-TURBO-HESSIAN-AI
16
 
17
  This model is fine-tuned on a carefully curated 13 hour dataset.
18
 
19
 
20
  ## Evaluations - Word error rate
21
 
22
- | Test-Dataset | openai-whisper-large-v3-turbo | **GRAG-WHISPER-LARGE-v3-TURBO** | primeline-whisper-large-v3-turbo-german |
23
  |-------------------------------------|-------------------------------|-------------------------|-----------------------------------|
24
  | Tuda-De | 8.195 | **6.360** | 6.441 |
25
  | common_voice_19_0 | 3.839 | 3.249 | **3.217** |
26
  | multilingual librispeech | 3.202 | 2.071 | **2.067** |
27
  | All | 3.641 | 2.633 | **2.630** |
28
 
29
- The data and code for evaluations are available [here](https://huggingface.co/datasets/avemio/ASR-GERMAN-MIXED-EVALS-GRAG)
30
 
31
  ### Training data
32
  The training data for this model includes conversations of spoken German with a mix of english business phrases included. The data was carefully selected and processed to optimize recognition performance. The dataset will not be published because of unclear situation if the data would be used for voice-cloning. The rights to use the collected data are only for the intended use to train speech-to-text models.
@@ -39,7 +39,7 @@ from transformers import AutoModelForSpeechSeq2Seq, AutoProcessor, pipeline
39
  from datasets import load_dataset
40
  device = "cuda:0" if torch.cuda.is_available() else "cpu"
41
  torch_dtype = torch.float16 if torch.cuda.is_available() else torch.float32
42
- model_id = "avemio/GRAG-WHISPER-LARGE-v3-TURBO"
43
  model = AutoModelForSpeechSeq2Seq.from_pretrained(
44
  model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True
45
  )
@@ -69,4 +69,11 @@ print(result["text"])
69
  - Transformers 4.47.1
70
  - Pytorch 2.5.1+cu121
71
  - Datasets 3.2.0
72
- - Tokenizers 0.21.0
 
 
 
 
 
 
 
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
+ # German-RAG-WHISPER-LARGE-v3-TURBO-HESSIAN-AI
16
 
17
  This model is fine-tuned on a carefully curated 13 hour dataset.
18
 
19
 
20
  ## Evaluations - Word error rate
21
 
22
+ | Test-Dataset | openai-whisper-large-v3-turbo | **German-RAG-WHISPER-LARGE-v3-TURBO** | primeline-whisper-large-v3-turbo-german |
23
  |-------------------------------------|-------------------------------|-------------------------|-----------------------------------|
24
  | Tuda-De | 8.195 | **6.360** | 6.441 |
25
  | common_voice_19_0 | 3.839 | 3.249 | **3.217** |
26
  | multilingual librispeech | 3.202 | 2.071 | **2.067** |
27
  | All | 3.641 | 2.633 | **2.630** |
28
 
29
+ The data and code for evaluations are available [here](https://huggingface.co/datasets/avemio/ASR-GERMAN-MIXED-EVALS-German-RAG)
30
 
31
  ### Training data
32
  The training data for this model includes conversations of spoken German with a mix of english business phrases included. The data was carefully selected and processed to optimize recognition performance. The dataset will not be published because of unclear situation if the data would be used for voice-cloning. The rights to use the collected data are only for the intended use to train speech-to-text models.
 
39
  from datasets import load_dataset
40
  device = "cuda:0" if torch.cuda.is_available() else "cpu"
41
  torch_dtype = torch.float16 if torch.cuda.is_available() else torch.float32
42
+ model_id = "avemio/German-RAG-WHISPER-LARGE-v3-TURBO"
43
  model = AutoModelForSpeechSeq2Seq.from_pretrained(
44
  model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True
45
  )
 
69
  - Transformers 4.47.1
70
  - Pytorch 2.5.1+cu121
71
  - Datasets 3.2.0
72
+ - Tokenizers 0.21.0
73
+
74
+
75
+ ## The German-RAG AI Team
76
+ [Marcel Rosiak](https://de.linkedin.com/in/marcel-rosiak)
77
+ [Soumya Paul](https://de.linkedin.com/in/soumya-paul-1636a68a)
78
+ [Siavash Mollaebrahim](https://de.linkedin.com/in/siavash-mollaebrahim-4084b5153?trk=people-guest_people_search-card)
79
+ [Zain ul Haq](https://de.linkedin.com/in/zain-ul-haq-31ba35196)