Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper โข 2604.13016 โข Published Apr 14 โข 94
UltraData Collection Ultra Scale, Ultra Quality, Ultra Coverage โข 10 items โข Updated 2 days ago โข 82
InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation Paper โข 2509.24663 โข Published Sep 29, 2025 โข 16