AdversarialRLHF/pythia410m-rm-tldr6.9b_randomizeprefix Text Classification • 0.4B • Updated Apr 25 • 5
AdversarialRLHF/pythia410m-rm-tldr6.9b_logprobcondprefix Text Classification • 0.4B • Updated Apr 26 • 5
AdversarialRLHF/pythia410m-rm-tldr6.9b_logprobcondsuffix Text Classification • 0.4B • Updated Apr 26 • 4
AdversarialRLHF/pythia410m-rm-tldr6.9b_logprobcondboth Text Classification • 0.4B • Updated Apr 26 • 4
AdversarialRLHF/pythia410m-rm-tldr6.9b_logprobcondallprefix Text Classification • 0.4B • Updated Apr 27 • 5
AdversarialRLHF/pythia410m-rm-tldr6.9b_logprobcondpropallprefix Text Classification • 0.4B • Updated Apr 27 • 5
AdversarialRLHF/pythia410m-rm-tldr6.9b_logprobcondpropprefix Text Classification • 0.4B • Updated Apr 27 • 4
AdversarialRLHF/pythia410m-rm-tldr6.9b_prefix_in_chosen Text Classification • 0.4B • Updated Apr 30 • 7