Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published 25 days ago • 90
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 25 days ago • 43
AF Adapter: Continual Pretraining for Building Chinese Biomedical Language Model Paper • 2211.11363 • Published Nov 21, 2022 • 1
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning Paper • 2405.12130 • Published May 20, 2024 • 51
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning Paper • 2403.17919 • Published Mar 26, 2024 • 16
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6, 2024 • 189
ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models Paper • 2403.16187 • Published Mar 24, 2024
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published 18 days ago • 158
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning Paper • 2505.16400 • Published 2 days ago • 19
Model Merging in Pre-training of Large Language Models Paper • 2505.12082 • Published 7 days ago • 34
A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone Paper • 2505.12781 • Published 5 days ago • 2