Zhihong Shao's picture

4

Zhihong Shao

ZhihongShao

·

https://zhihongshao.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

authored a paper 3 days ago

DeepSeek-V3 Technical Report

authored a paper 6 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

Organizations

ZhihongShao's activity

authored 2 papers 3 days ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 82

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 40

authored a paper 6 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 7 days ago • 261

authored 2 papers 6 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 53

Learning Task Decomposition to Assist Humans in Competitive Programming

Paper • 2406.04604 • Published Jun 7, 2024 • 4

authored a paper 7 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 60

authored a paper 8 months ago

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 37

New activity in deepseek-ai/deepseek-math-7b-rl 11 months ago

Upload generation_config.json

#2 opened 12 months ago by

missing generation_config.json

#1 opened 12 months ago by

Should I use CoT prompting in RL model as instruction-tuned model?

#3 opened 11 months ago by

Should I use CoT prompting in RL model as instruction-tuned model?

#3 opened 11 months ago by