Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
2
yuxuanxie
yuxuan99
Follow
0 followers
ยท
6 following
AI & ML interests
None yet
Recent Activity
reacted
to
Jaward
's
post
with ๐ค
about 21 hours ago
nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero
replied
to
Jaward
's
post
about 23 hours ago
nice clean GRPO implementation: - no transformers - no vllm - has improved grpo (DAPO) - under 300 lines - runs on 24GB (RTX 4090 GPU) Code: https://github.com/policy-gradient/GRPO-Zero
View all activity
Organizations
yuxuan99
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
collection
2 months ago
Deepseek Papers
Collection
Deepseek papers collection
โข
19 items
โข
Updated
18 days ago
โข
190
upvoted
an
article
11 months ago
view article
Article
Train a Sentence Embedding Model with 1B Training Pairs
Oct 25, 2021
โข
1