Baohao Liao

baohao

AI & ML interests

NLP

Recent Activity

Organizations

RWTH Aachen University's profile picture University of Amsterdam's profile picture

baohao's activity

New activity in deepseek-ai/DeepSeek-R1-0528-Qwen3-8B 7 days ago

Model collapse after SFT

3
#14 opened 11 days ago by
Banjiuyufen
upvoted an article 3 months ago
view article
Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By NormalUhr •
• 43
New activity in Qwen/QwQ-32B 3 months ago

missing opening <think>

20
#4 opened 3 months ago by
getfit
New activity in cognitivecomputations/DeepSeek-R1-AWQ 4 months ago

Deployment framework

27
#2 opened 5 months ago by
xro7
updated a model 10 months ago