LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers Paper • 2507.04404 • Published 10 days ago • 19
BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity Paper • 2507.08771 • Published 5 days ago • 7
MetaStone-S1 Collection The open-source model of MetaStone-S1. • 3 items • Updated 12 days ago • 9
Test-Time Scaling with Reflective Generative Model Paper • 2507.01951 • Published 14 days ago • 84
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published 6 days ago • 28 • 9
🧠SmolLM3 Collection Smol, multilingual, long-context reasoner • 10 items • Updated 6 days ago • 59
Dynamic Chunking for End-to-End Hierarchical Sequence Modeling Paper • 2507.07955 • Published 6 days ago • 15
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published 6 days ago • 28 • 9
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published 6 days ago • 28
IFBench Collection Datasets for IFBench benchmark and paper! • 3 items • Updated 13 days ago • 1
Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search Paper • 2507.02652 • Published 13 days ago • 23
Rewarding the Unlikely: Lifting GRPO Beyond Distribution Sharpening Paper • 2506.02355 • Published Jun 3 • 1
Bridging Offline and Online Reinforcement Learning for LLMs Paper • 2506.21495 • Published 20 days ago • 1