Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
yuxuanxie's picture
4 2

yuxuanxie

yuxuan99
ยท

AI & ML interests

None yet

Recent Activity

reacted to Jaward's post with ๐Ÿค— about 1 month ago
nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero
replied to Jaward's post about 1 month ago
nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero
upvoted an article 3 months ago
Open R1: Update #3
View all activity

Organizations

Hugging Face Discord Community's profile picture

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs