***'s picture

1

***

free126

AI & ML interests

None yet

Recent Activity

updated a model 10 days ago

free126/Qwen2-0.5B-GRPO-test

published a model 10 days ago

free126/Qwen2-0.5B-GRPO-test

commented on an article 11 days ago

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

View all activity

Organizations

None yet

free126's activity

updated a model 10 days ago

free126/Qwen2-0.5B-GRPO-test

Updated 10 days ago

published a model 10 days ago

free126/Qwen2-0.5B-GRPO-test

Updated 10 days ago

commented on From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning 11 days ago

你就纯摆论文没点自己的理解和解释发什么文章？

updated a model 10 months ago

free126/OrpoLlama-3-8B

Updated May 7, 2024

New activity in timpal0l/mdeberta-v3-base-squad2 about 1 year ago

This model seems to perform poorly in Chinese

#5 opened about 1 year ago by

updated a model about 1 year ago

free126/bert-finetuned-squad

Question Answering • Updated Feb 21, 2024 • 112