Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 11 days ago • 34 • 6
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 11 days ago • 34 • 6
Omnilingual MT: Machine Translation for 1,600 Languages Paper • 2603.16309 • Published 26 days ago • 21 • 5
TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning Paper • 2603.12529 • Published about 1 month ago • 19 • 3
Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data Paper • 2603.07534 • Published Mar 8 • 5 • 3
Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language Paper • 2602.18964 • Published Feb 21 • 1 • 4
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better Paper • 2602.05393 • Published Feb 5 • 8 • 3
Chronicals: A High-Performance Framework for LLM Fine-Tuning with 3.51x Speedup over Unsloth Paper • 2601.02609 • Published Jan 6 • 2 • 2
EPAS: Efficient Training with Progressive Activation Sharing Paper • 2601.19089 • Published Jan 27 • 1 • 1
Chronicals: A High-Performance Framework for LLM Fine-Tuning with 3.51x Speedup over Unsloth Paper • 2601.02609 • Published Jan 6 • 2 • 2
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning Paper • 2601.09088 • Published Jan 14 • 63 • 6