Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
***
free126
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
10 days ago
free126/Qwen2-0.5B-GRPO-test
published
a model
10 days ago
free126/Qwen2-0.5B-GRPO-test
commented
on
an
article
11 days ago
From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning
View all activity
Organizations
None yet
free126
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
10 days ago
free126/Qwen2-0.5B-GRPO-test
Updated
10 days ago
published
a model
10 days ago
free126/Qwen2-0.5B-GRPO-test
Updated
10 days ago
commented
on
From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning
11 days ago
view reply
你就纯摆论文没点自己的理解和解释发什么文章?
updated
a model
10 months ago
free126/OrpoLlama-3-8B
Updated
May 7, 2024
New activity in
timpal0l/mdeberta-v3-base-squad2
about 1 year ago
This model seems to perform poorly in Chinese
#5 opened about 1 year ago by
free126
updated
a model
about 1 year ago
free126/bert-finetuned-squad
Question Answering
•
Updated
Feb 21, 2024
•
112