Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
16
56
MC
Dreamer312
Follow
Dreamer
AI & ML interests
NLP, CV, LLM, AGENT, RL
Recent Activity
commented
on
a paper
about 22 hours ago
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
commented
on
a paper
about 22 hours ago
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
upvoted
a
paper
2 days ago
Scaling Law for Quantization-Aware Training
View all activity
Organizations
None yet
Papers
1
arxiv:
2409.10262
models
2
Sort: Recently updated
Dreamer312/Qwen-2.5-1.5B-Simple-RL
Updated
15 days ago
•
1
Dreamer312/Qwen-2.5-7B-Simple-RL
Updated
17 days ago
datasets
0
None public yet