Reasoning in Latent Space Training Large Language Models to Reason in a Continuous Latent Space Paper β’ 2412.06769 β’ Published Dec 9, 2024 β’ 87
Training Large Language Models to Reason in a Continuous Latent Space Paper β’ 2412.06769 β’ Published Dec 9, 2024 β’ 87
MTP Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding Paper β’ 2402.05109 β’ Published Feb 7, 2024 β’ 1 Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper β’ 2401.10774 β’ Published Jan 19, 2024 β’ 59 Better & Faster Large Language Models via Multi-token Prediction Paper β’ 2404.19737 β’ Published Apr 30, 2024 β’ 78 Exploring the Latent Capacity of LLMs for One-Step Text Generation Paper β’ 2505.21189 β’ Published May 27 β’ 62
Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding Paper β’ 2402.05109 β’ Published Feb 7, 2024 β’ 1
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper β’ 2401.10774 β’ Published Jan 19, 2024 β’ 59
Better & Faster Large Language Models via Multi-token Prediction Paper β’ 2404.19737 β’ Published Apr 30, 2024 β’ 78
Exploring the Latent Capacity of LLMs for One-Step Text Generation Paper β’ 2505.21189 β’ Published May 27 β’ 62
Reasoning in Latent Space Training Large Language Models to Reason in a Continuous Latent Space Paper β’ 2412.06769 β’ Published Dec 9, 2024 β’ 87
Training Large Language Models to Reason in a Continuous Latent Space Paper β’ 2412.06769 β’ Published Dec 9, 2024 β’ 87
MTP Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding Paper β’ 2402.05109 β’ Published Feb 7, 2024 β’ 1 Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper β’ 2401.10774 β’ Published Jan 19, 2024 β’ 59 Better & Faster Large Language Models via Multi-token Prediction Paper β’ 2404.19737 β’ Published Apr 30, 2024 β’ 78 Exploring the Latent Capacity of LLMs for One-Step Text Generation Paper β’ 2505.21189 β’ Published May 27 β’ 62
Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding Paper β’ 2402.05109 β’ Published Feb 7, 2024 β’ 1
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper β’ 2401.10774 β’ Published Jan 19, 2024 β’ 59
Better & Faster Large Language Models via Multi-token Prediction Paper β’ 2404.19737 β’ Published Apr 30, 2024 β’ 78
Exploring the Latent Capacity of LLMs for One-Step Text Generation Paper β’ 2505.21189 β’ Published May 27 β’ 62