Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
Jixuan Leng
PRO
JixuanLeng
Follow
0 followers
·
9 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 5 hours ago
RLProj/qwen2.5-7b-gspo
updated
a collection
about 5 hours ago
GRPO_Models
published
a model
about 5 hours ago
RLProj/qwen2.5-7b-gspo
View all activity
Organizations
JixuanLeng
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
about 5 hours ago
RLProj/qwen2.5-7b-gspo
8B
•
Updated
about 5 hours ago
updated
a collection
about 5 hours ago
GRPO_Models
Collection
7 items
•
Updated
about 5 hours ago
published
a model
about 5 hours ago
RLProj/qwen2.5-7b-gspo
8B
•
Updated
about 5 hours ago
updated
a model
about 7 hours ago
ConfRL/Qwen2.5-7B-Confidence-SFT
8B
•
Updated
about 7 hours ago
•
3.78k
updated
a dataset
1 day ago
ConfRL/General-RL-Collection
Viewer
•
Updated
1 day ago
•
63.9k
•
48
updated
a collection
3 days ago
GRPO_Models
Collection
7 items
•
Updated
about 5 hours ago
updated
a model
3 days ago
RLProj/qwen2.5-7b-grpo-entropy-shaping
8B
•
Updated
3 days ago
•
165
published
a model
3 days ago
RLProj/qwen2.5-7b-grpo-entropy-shaping
8B
•
Updated
3 days ago
•
165
updated
a dataset
5 days ago
ConfRL/General-RL-Collection-Sampled
Viewer
•
Updated
5 days ago
•
607k
•
49
published
2 datasets
5 days ago
ConfRL/General-RL-Collection-Sampled
Viewer
•
Updated
5 days ago
•
607k
•
49
ConfRL/General-RL-Collection
Viewer
•
Updated
1 day ago
•
63.9k
•
48
updated
a dataset
6 days ago
RLProj/math-rl-collection
Viewer
•
Updated
6 days ago
•
58.1k
•
5
published
a dataset
6 days ago
RLProj/math-rl-collection
Viewer
•
Updated
6 days ago
•
58.1k
•
5
updated
a dataset
7 days ago
ConfRL/math-rl-collection-sampled
Viewer
•
Updated
7 days ago
•
12.1M
•
59
updated
a model
7 days ago
RLProj/qwen2.5-7b-dapo-baseline
8B
•
Updated
7 days ago
•
479
updated
a collection
7 days ago
GRPO_Models
Collection
7 items
•
Updated
about 5 hours ago
published
a model
7 days ago
RLProj/qwen2.5-7b-dapo-baseline
8B
•
Updated
7 days ago
•
479
updated
a model
10 days ago
RLProj/qwen2.5-7b-grpo-trainable-cb
8B
•
Updated
10 days ago
•
717
updated
a collection
10 days ago
GRPO_Models
Collection
7 items
•
Updated
about 5 hours ago
published
a model
10 days ago
RLProj/qwen2.5-7b-grpo-trainable-cb
8B
•
Updated
10 days ago
•
717
Load more