Safetensors
English
qwen2

Model Performance Overview

Metrics:

  • CER: Character Error Rate (lower = better).
  • WER: Word Error Rate (lower = better).
Model CER WER
SALT-asr 8.42 18.49

Our Solution

  • Method: Extends a pre-trained LLM with audio tokens and fine-tunes on ASR task.
  • Audio tokenization: SpeechTokenizer (semantic tokens only).

Resources


Downloads last month
2
Safetensors
Model size
495M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Vikhrmodels/salt-qwen2.5-0.5b-asr

Base model

Qwen/Qwen2.5-0.5B
Finetuned
(353)
this model

Datasets used to train Vikhrmodels/salt-qwen2.5-0.5b-asr

Collection including Vikhrmodels/salt-qwen2.5-0.5b-asr