Zheng Zhang's picture

2 2 1

Zheng Zhang

qpz

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

authored a paper 19 days ago

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

commented on a paper 4 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

View all activity

Organizations

upvoted a paper 12 days ago

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

Paper • 2508.08401 • Published 14 days ago • 39

authored a paper 19 days ago

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

Paper • 2504.13914 • Published Apr 10 • 4

commented a paper 4 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published Apr 15 • 19 •

upvoted a paper 5 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 137

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-32B 7 months ago

Generate crashed by repeatedly generating <think>

#35 opened 7 months ago by

liked a model almost 2 years ago

CofeAI/FLM-101B

Text Generation • Updated Sep 18, 2023 • 6 • 91

updated 4 models over 2 years ago

ConvLab/gpt2-medium-nlg-tm1_tm2_tm3

Updated Dec 26, 2022

ConvLab/gpt2-medium-nlg-multiwoz21

Updated Dec 26, 2022

ConvLab/gpt2-medium-nlg-multiwoz21_sgd_tm1_tm2_tm3

Updated Dec 26, 2022

ConvLab/gpt2-medium-nlg-sgd

Updated Dec 26, 2022