wooihen
AI & ML interests
machine learning, NLP, computer vision and RL
Recent Activity
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
view article
How to deploy and fine-tune DeepSeek models on AWS
upvoted
an
article
about 1 year ago
view article
How we leveraged distilabel to create an Argilla 2.0 Chatbot