Wav2Vec 2.0 - a facebook Collection

facebook 's Collections

Physics of Language Models: Part 4.2

Web-SSL

blt

Perception Encoder

DRAMA

Sparsh

Seamless Communication

MAGNeT

XLSR

XLS-R

Robust Wav2Vec 2.0

HuBERT

Fairseq S^2 TTS

Dinov2

MusicGen Stereo

Sapiens

OPT

FAIR's LayerSkip Llama models

Wav2Vec 2.0

updated Jan 16, 2024

A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data.

facebook/wav2vec2-large-960h-lv60-self

Automatic Speech Recognition • Updated May 23, 2022 • 58.6k • 150

Note The Wav2Vec 2.0 "large" model pre-trained on 53k hours of un-labelled audio data from the LibriSpeech and LibriVox (LV) corpora, and fine-tuned on 960 hours of LibriSpeech ASR data. This is the most performant Wav2Vec 2.0 checkpoint from the initial release, obtaining 1.9/3.9% WER on the LibriSpeech test clean/other subsets respectively.
facebook/wav2vec2-large-960h

Automatic Speech Recognition • Updated Apr 5, 2022 • 61.8k • 31

Note The Wav2Vec 2.0 "large" model pre-trained and fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/wav2vec2-base-960h

Automatic Speech Recognition • 0.1B • Updated Nov 14, 2022 • 1.21M • 361

Note The Wav2Vec 2.0 "base" model pre-trained and fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/wav2vec2-base-100h

Automatic Speech Recognition • Updated May 27, 2022 • 895 • 6

Note The Wav2Vec 2.0 "base" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data, and fine-tuned on 100 hours of labelled LibriSpeech ASR data.
facebook/wav2vec2-large-lv60

Updated Dec 28, 2021 • 12k • 10

Note The Wav2Vec 2.0 "large" model pre-trained on 53k hours of un-labelled data from the LibriSpeech and LibriVox (LV) corpora.
facebook/wav2vec2-large

Updated Aug 26, 2022 • 3.89k • 7

Note The Wav2Vec 2.0 "large" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data.
facebook/wav2vec2-base

Updated Dec 28, 2021 • 1.05M • 108

Note The Wav2Vec 2.0 "base" model pre-trained on 960 hours of un-labelled LibriSpeech ASR data.
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations

Paper • 2006.11477 • Published Jun 20, 2020 • 6

Note The wav2vec 2.0 paper, accepted to NeurIPS 2020.