Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling Paper • 2503.04398 • Published Mar 6 • 1