FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published 13 days ago • 17
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published 13 days ago • 17
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published 13 days ago • 17
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published 13 days ago • 17
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published 13 days ago • 17
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models Paper • 2501.06751 • Published Jan 12 • 33
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17