Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
2
yuxuanxie
yuxuan99
Follow
0 followers
ยท
6 following
AI & ML interests
None yet
Recent Activity
reacted
to
Jaward
's
post
with ๐ค
about 18 hours ago
nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero
replied
to
Jaward
's
post
about 20 hours ago
nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero
View all activity
Organizations
models
None public yet
datasets
None public yet