PavelMB/mistral-7b-instruct-v0.3-h-expert-v1 Text Generation • 4B • Updated about 23 hours ago • 7 • 1
PavelMB/mistral-7b-instruct-v0.3-h-expert-v2 Text Generation • 4B • Updated about 22 hours ago • 3 • 1
dshin/flan-t5-ppo-user-h-batch-size-8-epoch-0-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 16
dshin/flan-t5-ppo-user-f-batch-size-8-epoch-0-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 13
dshin/flan-t5-ppo-user-e-batch-size-8-epoch-0-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 47
dshin/flan-t5-ppo-user-f-batch-size-8-epoch-1-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 15
dshin/flan-t5-ppo-user-h-batch-size-8-epoch-1-use-violation Reinforcement Learning • Updated Mar 13, 2023 • 15