Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
espnet 's Collections
Codec Survey - Pre-trained Models
OWSM: Fully Open Speech Recognition and Translation Models
OWLS: Scaling Laws for Speech Recognition and Translation
OWSM-CTC: Ultra-Fast Speech Foundation Models
Neural Codecs
XEUS Model and Data

OWSM-CTC: Ultra-Fast Speech Foundation Models

updated Mar 8

CTC-based models from the OWSM project, designed for fast non-autoregressive inference: https://www.wavlab.org/activities/2024/owsm/

Upvote
1

  • espnet/owsm_ctc_v3.2_ft_1B

    Automatic Speech Recognition • Updated 5 days ago • 64 • 4

  • espnet/owsm_ctc_v3.1_1B

    Automatic Speech Recognition • Updated 5 days ago • 54 • 13
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs