Jin Zhu
mamba413
AI & ML interests
reinforcement learning
Recent Activity
liked
a model
13 days ago
mistralai/Mistral-7B-v0.1
updated
a dataset
29 days ago
mamba413/GenerateText_Qwen2.5-1.5B-Instruct_GRPO_HH_Seed1
published
a dataset
about 1 month ago
mamba413/GenerateText_Qwen2.5-1.5B-Instruct_GRPO_HH_Seed1
Organizations
None yet