Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition Paper • 2601.13044 • Published 21 days ago • 12
GigaSpeech Series Collection Evolving, Large-Scale, and Multi-domain ASR Corpus • 4 items • Updated 20 days ago
k2SSL Collection A Faster and Better Framework for Self-Supervised Speech Representation Learning • 5 items • Updated 20 days ago
SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing Paper • 2601.09385 • Published 26 days ago
CLSP Collection Open-Ended Speaking Style Modeling via Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training • 4 items • Updated 2 days ago