---
license: cc
datasets:
- farabi-lab/kazakh-stt
language:
- kk
base_model:
- openai/whisper-small
tags:
- asr
- kazakh
---
# AitASR
**AitASR** is a fine-tuned version of OpenAI's Whisper Small model for Automatic Speech Recognition (ASR) in the Kazakh language. It was trained on the [`farabi-lab/kazakh-stt`](https://huggingface.co/datasets/farabi-lab/kazakh-stt) dataset to improve transcription quality for Kazakh audio.

---

## 🔧 Intended Use
The model is designed for ASR tasks involving Kazakh-language audio.  
It is suitable for:
- Transcription of Kazakh speech
- Voice command recognition
- Speech-driven applications in Kazakh

---

## ⚠️ Limitations
- May perform poorly on:
  - Low-quality or noisy audio
  - Audio from domains significantly different from the training data
- Not optimized for real-time use without further engineering

## 5. Citation
If you use this model, please cite it as follows:

```bibtex
@article{kadyrbek2023ksd,
  author = {Kadyrbek, N.; Mansurova, M.; Shomanov, A.; Makharova, G.},
  title = {The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters},
  journal = {Big Data and Cognitive Computing},
  year = {2023},
  volume = {7},
  number = {3},
  pages = {132},
  doi = {https://doi.org/10.3390/bdcc7030132}
}```

---
Commercial Use
For commercial use, please contact the author directly to discuss licensing terms and permissions.