view article Article Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm By royswastik • Mar 19 • 6