From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models Paper • 2401.02777 • Published Jan 5, 2024 • 1
1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training Paper • 2503.19633 • Published Mar 25
How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study Paper • 2504.00829 • Published Apr 1
Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability Paper • 2504.09639 • Published Apr 13
AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale Paper • 2505.08311 • Published May 13 • 16
Not All Correct Answers Are Equal: Why Your Distillation Source Matters Paper • 2505.14464 • Published May 20 • 8
Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Paper • 2503.19855 • Published Mar 25 • 28
DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training Paper • 2504.17565 • Published Apr 24 • 1
BELLE-2/Belle-whisper-large-v3-zh-punct Automatic Speech Recognition • 2B • Updated Apr 16 • 2.54k • 39
BELLE-2/Belle-whisper-large-v3-zh Automatic Speech Recognition • 2B • Updated Dec 16, 2024 • 2.38k • 113