Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
12
3
Zhicheng YANG
yangzhch6
Follow
allanjie's profile picture
dark-pen's profile picture
2 followers
·
2 following
https://yangzhch6.github.io/
yangzhch6
AI & ML interests
reasoning with LLMs
Recent Activity
updated
a model
about 23 hours ago
yangzhch6/Qwen3-4B-n8-sglang-no_kl-grpo-0.5-1e-6-step340
updated
a model
about 23 hours ago
yangzhch6/Qwen3-4B-n8-sglang-no_kl-grpo-0.5-1e-6-step300
published
a model
about 23 hours ago
yangzhch6/Qwen3-4B-n8-sglang-no_kl-grpo-0.5-1e-6-step340
View all activity
Organizations
None yet
yangzhch6
's models
39
Sort: Recently updated
yangzhch6/Qwen3-4B-n8-sglang-no_kl-grpo-0.5-1e-6-step340
4B
•
Updated
about 23 hours ago
•
5
yangzhch6/Qwen3-4B-n8-sglang-no_kl-grpo-0.5-1e-6-step300
4B
•
Updated
about 23 hours ago
•
5
yangzhch6/mcpfactory-qwen3-8b-newreward-step340
8B
•
Updated
2 days ago
•
11
yangzhch6/mcpfactory-qwen3-8b-newreward-step300
8B
•
Updated
2 days ago
•
11
yangzhch6/mcpfactory-qwen3-1.7b-newreward-step300
2B
•
Updated
2 days ago
•
8
yangzhch6/mcpfactory-qwen3-1.7b-newreward-step340
2B
•
Updated
2 days ago
•
10
yangzhch6/tool-verl-qwen1.5B-200step-0320
2B
•
Updated
4 days ago
•
13
yangzhch6/maxrl-qwen3-4b-base-dapo-bs128-n16-stepp400
4B
•
Updated
10 days ago
•
14
yangzhch6/Qwen2.5-Math-7B-Think32k
Text Generation
•
8B
•
Updated
19 days ago
•
15
yangzhch6/Qwen2.5-Math-7B-Think32k-Openr1ColdStart46k-Syn
333k
•
Updated
19 days ago
•
13
yangzhch6/Qwen2.5-Math-7B-Think32k-Openr1ColdStart46k
333k
•
Updated
20 days ago
•
11
yangzhch6/Qwen2.5-Math-7B-16k-Think-Synthesizer
8B
•
Updated
Nov 10, 2025
•
1
yangzhch6/cuda-12.8-tar
Updated
Oct 13, 2025
yangzhch6/cuda-12.8
Updated
Oct 13, 2025
yangzhch6/Mirror-Verifier-1.5B
2B
•
Updated
Sep 30, 2025
yangzhch6/Mirror-Verifier-7B
8B
•
Updated
Sep 30, 2025
yangzhch6/Zero-Solver-Qwen2.5-Math-7B-L
8B
•
Updated
Sep 30, 2025
•
2
yangzhch6/Zero-Solver-Qwen2.5-Math-1.5B-L
2B
•
Updated
Sep 30, 2025
yangzhch6/Qwen2.5-Math-7B-L
Text Generation
•
8B
•
Updated
Sep 30, 2025
yangzhch6/Qwen2.5-7B-openr1-nothink-3k-f3
Updated
Sep 19, 2025
yangzhch6/Qwen2.5-1.5B-openr1-nothink-3k-f3
Updated
Sep 19, 2025
yangzhch6/mix-rlvr-verify
Updated
Sep 14, 2025
yangzhch6/DARS-Llama-HW-Breadth
8B
•
Updated
Sep 14, 2025
•
4
yangzhch6/Qwen2.5-Math-7B-L-openr1-nothink-3k-f3-step100
8B
•
Updated
Sep 7, 2025
yangzhch6/Qwen2.5-Math-1.5B-L-openr1-nothink-3k-f3-step100
2B
•
Updated
Sep 7, 2025
yangzhch6/Qwen2.5-Math-1.5B-L
Text Generation
•
2B
•
Updated
Sep 6, 2025
yangzhch6/Qwen2.5-7B-L-openr1-f3-ckpt500
8B
•
Updated
Sep 4, 2025
•
3
yangzhch6/Qwen2.5-1.5B-L-openr1-f3-ckpt500
2B
•
Updated
Sep 4, 2025
yangzhch6/DARS-Llama-ET-Breadth
8B
•
Updated
Sep 3, 2025
yangzhch6/DARS-7B-HW-Breadth
8B
•
Updated
Sep 1, 2025
Previous
1
2
Next