Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published 24 days ago • 161
NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search Paper • 2505.14680 • Published May 20 • 9
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published May 8 • 177
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation Paper • 2503.22675 • Published Mar 28 • 35
Length-Induced Embedding Collapse in Transformer-based Models Paper • 2410.24200 • Published Oct 31, 2024 • 2
Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents Paper • 2503.08684 • Published Mar 11 • 5