rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset Paper • 2505.21297 • Published May 27 • 30
rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset Paper • 2505.21297 • Published May 27 • 30
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published Mar 3 • 88
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization Paper • 2502.04295 • Published Feb 6 • 13
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 280
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 280
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models Paper • 2409.17066 • Published Sep 25, 2024 • 29
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Paper • 2408.06195 • Published Aug 12, 2024 • 74
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22, 2024 • 257
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21, 2024 • 117
Boosting LLM Reasoning: Push the Limits of Few-shot Learning with Reinforced In-Context Pruning Paper • 2312.08901 • Published Dec 14, 2023
Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models Paper • 2310.05015 • Published Oct 8, 2023 • 1
Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference Paper • 2306.14393 • Published Jun 26, 2023
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices Paper • 2303.09730 • Published Mar 17, 2023 • 1
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference Paper • 2303.08308 • Published Mar 15, 2023 • 1