Model Merging in Pre-training of Large Language Models Paper • 2505.12082 • Published 7 days ago • 34
AdaptThink: Reasoning Models Can Learn When to Think Paper • 2505.13417 • Published 5 days ago • 70
CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information Paper • 2409.13199 • Published Sep 20, 2024
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published Jan 16 • 41
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published Apr 14 • 260