Perusha Moodley
moodlep
·
AI & ML interests
RL, DRL, Decision Transformers, Auxiliary signals, self-supervised methods
Recent Activity
liked
a dataset
1 day ago
Anthropic/hh-rlhf
upvoted
a
paper
13 days ago
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
updated
a model
20 days ago
moodlep/smollm2-17b-dpo-cai-v1
Organizations
Collections
1
models
9
moodlep/smollm2-17b-dpo-cai-v1
Updated
•
8
moodlep/smollm2-1.7b-instr-sft-cai-v1
Updated
moodlep/smollm2-1.7b-instr-sft-cai
Updated
•
16
moodlep/mistral-7b-sft-constitutional-ai
Updated
•
6
moodlep/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
moodlep/output
Updated
moodlep/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
•
4
moodlep/ppo-Huggy
Reinforcement Learning
•
Updated
•
24
moodlep/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
5