Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
Yifan Peng
pyf98
AI & ML interests
Speech Processing, Speech Recognition, Spoken Language Processing
Recent Activity
authored
a paper
3 days ago
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
authored
a paper
3 days ago
On the Effects of Heterogeneous Data Sources on Speech-to-Text
Foundation Models
authored
a paper
3 days ago
OWLS: Scaling Laws for Multilingual Speech Recognition and Translation
Models
Organizations
Collections
1
spaces
1
models
48

pyf98/DPHuBERT
Updated
•
4

pyf98/fisher_callhome_spanish_e_branchformer
Automatic Speech Recognition
•
Updated
•
5

pyf98/fisher_callhome_spanish_conformer
Automatic Speech Recognition
•
Updated
•
3

pyf98/slurp_entity_e_branchformer
Automatic Speech Recognition
•
Updated
•
4

pyf98/aidatatang_200zh_e_branchformer_e16
Automatic Speech Recognition
•
Updated
•
4

pyf98/librispeech_100_transducer_e_branchformer
Automatic Speech Recognition
•
Updated
•
3

pyf98/librispeech_100_transducer_conformer
Automatic Speech Recognition
•
Updated
•
2
•
1

pyf98/jsut_e_branchformer
Automatic Speech Recognition
•
Updated
•
6

pyf98/aishell_ctc_e_branchformer_e12
Automatic Speech Recognition
•
Updated
•
2

pyf98/aishell_ctc_conformer_e15_linear1024
Automatic Speech Recognition
•
Updated
•
3
•
2
datasets
0
None public yet