mitchelldehaven
/

whisper-large-v2-ru

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

whisper-large-v2-ru / README.md

mitchelldehaven's picture

mitchelldehaven

fix for leaderboard (#1)

1f8d15a almost 2 years ago

|

history blame contribute delete

724 Bytes

	---
	model-index:
	- name: whisper-large-v2-ru
	results:
	- task:
	type: automatic-speech-recognition
	name: Automatic Speech Recognition
	dataset:
	name: mozilla-foundation/common_voice_11_0
	type: mozilla-foundation/common_voice_11_0
	config: ru
	split: test
	args: ru
	metrics:
	- type: wer
	value: 7.73
	name: WER
	tags:
	- whisper-event
	---

	Whisper model finetuned using audio data from Open STT Russian Dataset (https://github.com/snakers4/open_stt).

	There is a differences in tokenization of source data (in our data normalization process, we replace punctucation with "" rather than Whisper's " "). This mismatch leads to a slight degradation on CommonVoice.