How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 108
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 165
ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition Paper • 2210.13352 • Published Oct 24, 2022 • 3
Multi-Span Acoustic Modelling using Raw Waveform Signals Paper • 1906.11047 • Published Jun 21, 2019 • 1
Latent Consistency Models LoRAs Collection Latent Consistency Models for Stable Diffusion - LoRAs and full fine-tuned weights • 4 items • Updated Nov 10, 2023 • 99
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper • 2311.00430 • Published Nov 1, 2023 • 56
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics Paper • 2310.13268 • Published Oct 20, 2023 • 17
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference Paper • 2310.04378 • Published Oct 6, 2023 • 19
Short-Form Test Sets Collection A collection of short-form (samples < 30s) datasets used to evaluate the Distil-Whisper models. • 4 items • Updated Mar 21 • 3
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion Paper • 2310.03502 • Published Oct 5, 2023 • 77
DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models Paper • 2211.01095 • Published Nov 2, 2022 • 1
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Paper • 2307.06949 • Published Jul 13, 2023 • 50
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper • 2307.01952 • Published Jul 4, 2023 • 82