Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23 • 80
T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models Paper • 2504.04718 • Published Apr 7 • 41
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks Paper • 2305.18395 • Published May 28, 2023 • 1
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models Paper • 2410.01524 • Published Oct 2, 2024 • 3
Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language Models Paper • 2411.00686 • Published Nov 1, 2024