naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B Text Generation • Updated 3 days ago • 18.1k • 162
nvidia/Llama-3.1-Nemotron-8B-UltraLong-4M-Instruct Text Generation • Updated 23 days ago • 5.6k • 105
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published 21 days ago • 121