arxiv:2501.12948
PEIYI, WANG
peiyiwang89
AI & ML interests
None yet
Recent Activity
authored
a paper
5 days ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
authored
a paper
5 months ago
Towards a Unified View of Preference Learning for Large Language Models:
A Survey
authored
a paper
7 months ago
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Organizations
None yet
models
1
datasets
None public yet