dshin
AI & ML interests
None yet
Organizations
None yet
models
58
dshin/flan-t5-ppo-user-a-allenai-prosocial-dialog
Reinforcement Learning
•
Updated
•
18
dshin/flan-t5-ppo-user-e-allenai-prosocial-dialog
Reinforcement Learning
•
Updated
•
14
dshin/flan-t5-ppo-user-h-allenai-prosocial-dialog
Reinforcement Learning
•
Updated
•
29
dshin/flan-t5-ppo-user-f-allenai-prosocial-dialog
Reinforcement Learning
•
Updated
•
28
dshin/flan-t5-ppo-user-a-allenai-prosocial-dialog-testing-upload
Reinforcement Learning
•
Updated
•
13
dshin/flan-t5-ppo-user-e-batch-size-64
Reinforcement Learning
•
Updated
•
38
dshin/flan-t5-ppo-user-e-batch-size-64-use-violation
Reinforcement Learning
•
Updated
•
28
dshin/flan-t5-ppo-user-h-batch-size-64-use-violation
Reinforcement Learning
•
Updated
•
59
dshin/flan-t5-ppo-user-f-batch-size-64-use-violation
Reinforcement Learning
•
Updated
•
16
dshin/flan-t5-ppo-user-f-batch-size-64
Reinforcement Learning
•
Updated
•
15