Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published 29 days ago • 10
A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More Paper • 2407.16216 • Published Jul 23, 2024
100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Paper • 2505.00551 • Published 26 days ago • 36
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published 20 days ago • 147