Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
espnet 's Collections
ARECHO Series
OpusLM
UniVERSA
Codec Survey - Pre-trained Models
OWSM: Fully Open Speech Recognition and Translation Models
OWLS: Scaling Laws for Speech Recognition and Translation
OWSM-CTC: Ultra-Fast Speech Foundation Models
Neural Codecs
XEUS Model and Data

OWLS: Scaling Laws for Speech Recognition and Translation

updated May 3

🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate.

Upvote
7

  • espnet/owls_4B_180K

    Automatic Speech Recognition • Updated May 3 • 16 • 5

  • espnet/owls_9B_180K

    Automatic Speech Recognition • Updated May 3 • 18

  • espnet/owls_05B_180K

    Automatic Speech Recognition • Updated May 3 • 3

  • espnet/owls_025B_180K

    Automatic Speech Recognition • Updated May 3 • 6

  • espnet/owls_1B_180K

    Automatic Speech Recognition • Updated May 3 • 8 • 3

  • espnet/owls_2B_180K

    Automatic Speech Recognition • Updated May 3 • 6

  • espnet/owls_18B_180K

    Automatic Speech Recognition • Updated May 3 • 4 • 1

  • espnet/owls_18B_360K

    Automatic Speech Recognition • Updated May 3 • 9 • 1
Upvote
7
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs