Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Vikhrmodels
/
salt-qwen2.5-0.5b-asr
like
0
Follow
Vikhr models
427
Safetensors
openslr/librispeech_asr
amphion/Emilia-Dataset
English
qwen2
Model card
Files
Files and versions
xet
Community
Model Performance Overview
Our Solution
Resources
Model Performance Overview
Metrics
:
CER
: Character Error Rate (lower = better).
WER
: Word Error Rate (lower = better).
Model
CER
WER
SALT-asr
8.42
18.49
Our Solution
Method
: Extends a pre-trained LLM with audio tokens and fine-tunes on
ASR
task.
Audio tokenization
: SpeechTokenizer (semantic tokens only).
Resources
Code:
GitHub Repo
Downloads last month
2
Safetensors
Model size
495M params
Tensor type
F32
·
Chat template
Files info
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for
Vikhrmodels/salt-qwen2.5-0.5b-asr
Base model
Qwen/Qwen2.5-0.5B
Finetuned
(
353
)
this model
Datasets used to train
Vikhrmodels/salt-qwen2.5-0.5b-asr
amphion/Emilia-Dataset
Viewer
•
Updated
Feb 28
•
54.8M
•
62.3k
•
343
openslr/librispeech_asr
Updated
Aug 14, 2024
•
14.4k
•
156
Collection including
Vikhrmodels/salt-qwen2.5-0.5b-asr
SALT
Collection
3 items
•
Updated
10 days ago