Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Jiafei Lyu
dmux
Follow
BryantMcGill's profile picture
21world's profile picture
2 followers
ยท
1 following
https://dmksjfl.github.io/
dmksjfl
AI & ML interests
Reinforcement Learning
Recent Activity
authored
a paper
1 day ago
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
authored
a paper
11 months ago
SEABO: A Simple Search-Based Method for Offline Imitation Learning
authored
a paper
over 1 year ago
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
View all activity
Organizations
Papers
3
arxiv:
2504.00891
arxiv:
2402.03807
arxiv:
2311.13231
models
None public yet
datasets
None public yet