One-Step Diffusion Distillation via Deep Equilibrium Models Paper • 2401.08639 • Published Dec 12, 2023
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper • 2401.10774 • Published Jan 19, 2024 • 56