HuBERT - a facebook Collection

facebook 's Collections

DINOv3

Physics of Language Models: Part 4.2

Web-SSL

blt

Perception Encoder

DRAMA

Sparsh

Seamless Communication

MAGNeT

XLSR

XLS-R

Robust Wav2Vec 2.0

HuBERT

Fairseq S^2 TTS

DINOv2

MusicGen Stereo

Sapiens

OPT

FAIR's LayerSkip Llama models

HuBERT

updated Jan 16, 2024

A collection of checkpoints from the HuBERT release, a speech encoder that learns powerful representations from unlabelled audio data.

HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units

Paper • 2106.07447 • Published Jun 14, 2021 • 4

Note The HuBERT paper, accepted at IEEE/ACM Transactions on Audio, Speech and Language Processing Volume 29.
facebook/hubert-base-ls960

Feature Extraction • Updated Nov 5, 2021 • 298k • • 61

Note The "base" HuBERT model fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/hubert-large-ll60k

Feature Extraction • Updated Nov 5, 2021 • 31.5k • 30

Note The "large" HuBERT model pre-trained on LibriVox 60k hours.
facebook/hubert-large-ls960-ft

Automatic Speech Recognition • Updated May 24, 2022 • 342k • 74

Note A fine-tuned version of hubert-large-ll60k, fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/hubert-xlarge-ll60k

Feature Extraction • Updated Oct 20, 2021 • 545 • 5

Note The "extra-large" HuBERT model pre-trained on LibriVox 60k hours.
facebook/hubert-xlarge-ls960-ft

Automatic Speech Recognition • 1.0B • Updated Jun 27, 2023 • 3.98k • 14

Note A fine-tuned version of hubert-xlarge-ll60k, fine-tuned on 960 hours of LibriSpeech ASR data. This is the most performant HuBERT checkpoint in the release, achieving a WER of 1.8/2.9% on the LibriSpeech test clean/other subsets respectively.