--- license: cc datasets: - farabi-lab/kazakh-stt language: - kk base_model: - openai/whisper-small tags: - asr - kazakh --- # AitASR **AitASR** is a fine-tuned version of OpenAI's Whisper Small model for Automatic Speech Recognition (ASR) in the Kazakh language. It was trained on the [`farabi-lab/kazakh-stt`](https://huggingface.co/datasets/farabi-lab/kazakh-stt) dataset to improve transcription quality for Kazakh audio. --- ## 🔧 Intended Use The model is designed for ASR tasks involving Kazakh-language audio. It is suitable for: - Transcription of Kazakh speech - Voice command recognition - Speech-driven applications in Kazakh --- ## ⚠️ Limitations - May perform poorly on: - Low-quality or noisy audio - Audio from domains significantly different from the training data - Not optimized for real-time use without further engineering ## 5. Citation If you use this model, please cite it as follows: ```bibtex @article{kadyrbek2023ksd, author = {Kadyrbek, N.; Mansurova, M.; Shomanov, A.; Makharova, G.}, title = {The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters}, journal = {Big Data and Cognitive Computing}, year = {2023}, volume = {7}, number = {3}, pages = {132}, doi = {https://doi.org/10.3390/bdcc7030132} }``` --- Commercial Use For commercial use, please contact the author directly to discuss licensing terms and permissions.