Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published Apr 29 • 97
view post Post 2738 > New Model> Looks at Model Card> "Open-Weights" See translation 1 reply · 🔥 13 13 + Reply
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 461k • 1.48k