Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
6
2
Jiaxin Qin
JiaxinQin-cc
Follow
0 followers
·
1 following
https://jiaxinqin0814.github.io/
JiaxinQin0814
AI & ML interests
Natural Language Processing Reinforcement Learning
Recent Activity
upvoted
a
paper
4 days ago
MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning
upvoted
a
paper
5 days ago
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
upvoted
a
paper
12 days ago
s3: You Don't Need That Much Data to Train a Search Agent via RL
View all activity
Organizations
None yet
models
1
JiaxinQin-cc/MiniGrid-DistShift1-v0
Updated
Sep 15, 2023
•
1
datasets
1
JiaxinQin-cc/Offline-RL-MiniGrid
Preview
•
Updated
Sep 16, 2023
•
9