DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT Paper • 2110.01900 • Published Oct 5, 2021
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model Paper • 2210.00705 • Published Oct 3, 2022
USAD: Universal Speech and Audio Representation via Distillation Paper • 2506.18843 • Published Jun 23 • 11
OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning Paper • 2506.00338 • Published May 31 • 10