piotrzelasko commited on
Commit
e891611
·
verified ·
1 Parent(s): 770a734

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +172 -2
README.md CHANGED
@@ -1,3 +1,174 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # Model Overview
2
 
3
  ## Description:
@@ -248,5 +419,4 @@ As outlined in the paper "Towards Measuring Fairness in AI: the Casual Conversat
248
 
249
  ## Ethical Considerations:
250
  NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.
251
- For more detailed information on ethical considerations for this model, please see the Model Card++ Explainability, Bias, Safety & Security, and Privacy Subcards. Please report security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/).
252
-
 
1
+ ---
2
+ license: cc-by-4.0
3
+ language:
4
+ - en
5
+ library_name: nemo
6
+ datasets:
7
+ - Granary
8
+ - YTC
9
+ - Yodas2
10
+ - LibriLight
11
+ - librispeech_asr
12
+ - fisher_corpus
13
+ - Switchboard-1
14
+ - WSJ-0
15
+ - WSJ-1
16
+ - National-Singapore-Corpus-Part-1
17
+ - National-Singapore-Corpus-Part-6
18
+ - vctk
19
+ - voxpopuli
20
+ - europarl
21
+ - multilingual_librispeech
22
+ - mozilla-foundation/common_voice_8_0
23
+ - MLCommons/peoples_speech
24
+ thumbnail: null
25
+ tags:
26
+ - automatic-speech-recognition
27
+ - speech
28
+ - audio
29
+ - Transformer
30
+ - FastConformer
31
+ - Conformer
32
+ - pytorch
33
+ - NeMo
34
+ - Qwen
35
+ - hf-asr-leaderboard
36
+ widget:
37
+ - example_title: Librispeech sample 1
38
+ src: https://cdn-media.huggingface.co/speech_samples/sample1.flac
39
+ - example_title: Librispeech sample 2
40
+ src: https://cdn-media.huggingface.co/speech_samples/sample2.flac
41
+ model-index:
42
+ - name: canary-qwen-2.5b
43
+ results:
44
+ - task:
45
+ name: Automatic Speech Recognition
46
+ type: automatic-speech-recognition
47
+ dataset:
48
+ name: AMI (Meetings test)
49
+ type: edinburghcstr/ami
50
+ config: ihm
51
+ split: test
52
+ args:
53
+ language: en
54
+ metrics:
55
+ - name: Test WER
56
+ type: wer
57
+ value: 10.18
58
+ - task:
59
+ name: Automatic Speech Recognition
60
+ type: automatic-speech-recognition
61
+ dataset:
62
+ name: Earnings-22
63
+ type: revdotcom/earnings22
64
+ split: test
65
+ args:
66
+ language: en
67
+ metrics:
68
+ - name: Test WER
69
+ type: wer
70
+ value: 10.42
71
+ - task:
72
+ name: Automatic Speech Recognition
73
+ type: automatic-speech-recognition
74
+ dataset:
75
+ name: GigaSpeech
76
+ type: speechcolab/gigaspeech
77
+ split: test
78
+ args:
79
+ language: en
80
+ metrics:
81
+ - name: Test WER
82
+ type: wer
83
+ value: 9.41
84
+ - task:
85
+ name: Automatic Speech Recognition
86
+ type: automatic-speech-recognition
87
+ dataset:
88
+ name: LibriSpeech (clean)
89
+ type: librispeech_asr
90
+ config: other
91
+ split: test
92
+ args:
93
+ language: en
94
+ metrics:
95
+ - name: Test WER
96
+ type: wer
97
+ value: 1.6
98
+ - task:
99
+ name: Automatic Speech Recognition
100
+ type: automatic-speech-recognition
101
+ dataset:
102
+ name: LibriSpeech (other)
103
+ type: librispeech_asr
104
+ config: other
105
+ split: test
106
+ args:
107
+ language: en
108
+ metrics:
109
+ - name: Test WER
110
+ type: wer
111
+ value: 3.1
112
+ - task:
113
+ type: Automatic Speech Recognition
114
+ name: automatic-speech-recognition
115
+ dataset:
116
+ name: SPGI Speech
117
+ type: kensho/spgispeech
118
+ config: test
119
+ split: test
120
+ args:
121
+ language: en
122
+ metrics:
123
+ - name: Test WER
124
+ type: wer
125
+ value: 1.9
126
+ - task:
127
+ type: Automatic Speech Recognition
128
+ name: automatic-speech-recognition
129
+ dataset:
130
+ name: tedlium-v3
131
+ type: LIUM/tedlium
132
+ config: release1
133
+ split: test
134
+ args:
135
+ language: en
136
+ metrics:
137
+ - name: Test WER
138
+ type: wer
139
+ value: 2.72
140
+ - task:
141
+ name: Automatic Speech Recognition
142
+ type: automatic-speech-recognition
143
+ dataset:
144
+ name: Vox Populi
145
+ type: facebook/voxpopuli
146
+ config: en
147
+ split: test
148
+ args:
149
+ language: en
150
+ metrics:
151
+ - name: Test WER
152
+ type: wer
153
+ value: 5.66
154
+ metrics:
155
+ - wer
156
+ base_model:
157
+ - nvidia/canary-1b-flash
158
+ - Qwen/Qwen3-1.7B
159
+ ---
160
+
161
+ <style>
162
+ img {
163
+ display: inline;
164
+ }
165
+ </style>
166
+
167
+ [![Model architecture](https://img.shields.io/badge/Model_Arch-SALM-blue#model-badge)](#model-architecture)
168
+ | [![Model size](https://img.shields.io/badge/Params-2.5B-green#model-badge)](#model-architecture)
169
+ | [![Language](https://img.shields.io/badge/Language-en-orange#model-badge)](#datasets)
170
+
171
+
172
  # Model Overview
173
 
174
  ## Description:
 
419
 
420
  ## Ethical Considerations:
421
  NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.
422
+ For more detailed information on ethical considerations for this model, please see the Model Card++ Explainability, Bias, Safety & Security, and Privacy Subcards. Please report security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/).