Post
2077
QwenLong-L1π₯ long-context reasoning model by Alibaba Tongyi Zhiwen team.
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning (2505.17667)
Tongyi-Zhiwen/QwenLong-L1-32B
β¨ 32B & Apache 2.0
β¨ Outperforms OpenAI-o3-mini & Qwen3-235B-A22B
β¨ Trained on a unique 1.6K DocQA RL dataset spanning math, logic & multi-hop reasoning
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning (2505.17667)
Tongyi-Zhiwen/QwenLong-L1-32B
β¨ 32B & Apache 2.0
β¨ Outperforms OpenAI-o3-mini & Qwen3-235B-A22B
β¨ Trained on a unique 1.6K DocQA RL dataset spanning math, logic & multi-hop reasoning