ldwang

ldwang

AI & ML interests

LLM, MLLM, Infra

Recent Activity

upvoted a collection about 21 hours ago
SimpleRL
updated a collection 3 days ago
MiscModels
liked a model 3 days ago
deepseek-ai/deepseek-vl2-tiny
View all activity

Organizations

Beijing Academy of Artificial Intelligence's profile picture PetiteTech's profile picture

ldwang's activity

upvoted an article 13 days ago
view article
Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By NormalUhr
10
upvoted an article 26 days ago
view article
Article

Large-scale Near-deduplication Behind BigCode

22
upvoted an article 2 months ago
view article
Article

Low Latency CPU Based Educational Value Classifier With Generic Educational Value

By kenhktsui
9
upvoted an article 2 months ago
view article
Article

LLM数据工程3——数据收集魔法:获取顶级训练数据的方法

By JessyTsu1
17