ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet Paper • 2111.14706 • Published Nov 29, 2021
On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models Paper • 2406.09282 • Published Jun 13, 2024
OWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models Paper • 2502.10373 • Published Feb 14
Granary: Speech Recognition and Translation Dataset in 25 European Languages Paper • 2505.13404 • Published 18 days ago
OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning Paper • 2506.00338 • Published 7 days ago • 8
Open Whisper-style Speech Models (OWSM) Collection Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 21 items • Updated 4 days ago • 5
OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning Paper • 2506.00338 • Published 7 days ago • 8 • 2
OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning Paper • 2506.00338 • Published 7 days ago • 8
Open Whisper-style Speech Models (OWSM) Collection Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 21 items • Updated 4 days ago • 5