Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
44
11
taicheng guo
taicheng
Follow
tahamajs's profile picture
Mi6paulino's profile picture
lx865712528's profile picture
10 followers
·
57 following
AI & ML interests
None yet
Recent Activity
liked
a model
14 days ago
Qwen/Qwen3-0.6B
upvoted
a
paper
17 days ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
upvoted
a
paper
20 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
View all activity
Organizations
Papers
2
arxiv:
2402.01680
arxiv:
2305.18365
models
46
Sort: Recently updated
taicheng/zephyr-7b-align-scan-0.0-0.0-linear-1
Text Generation
•
Updated
Sep 28, 2024
•
14
taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-1
Text Generation
•
Updated
Sep 28, 2024
•
13
taicheng/zephyr-7b-align-scan-0.0-0.0-cosine-2
Text Generation
•
Updated
Sep 28, 2024
•
14
taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-2
Text Generation
•
Updated
Sep 28, 2024
•
13
taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-3
Text Generation
•
Updated
Sep 28, 2024
•
12
taicheng/zephyr-7b-align-scan-0.0-0.0-linear-3
Text Generation
•
Updated
Sep 28, 2024
•
13
taicheng/zephyr-7b-align-scan
Text Generation
•
Updated
Sep 28, 2024
•
14
taicheng/zephyr-7b-align-scan-1e-07-0.27-polynomial-1.0
Updated
Sep 28, 2024
taicheng/zephyr-7b-align-scan-7e-07-0.45-cosine-3.0
Text Generation
•
Updated
Sep 28, 2024
•
13
taicheng/zephyr-7b-align-scan-6e-07-0.53-polynomial-2.0
Text Generation
•
Updated
Sep 28, 2024
•
15
View 46 models
datasets
0
None public yet