ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-seed-bt Text Generation • 3B • Updated 19 days ago • 9
ZDCSlab/ripd-anthropic-saferlhf-gemma-2b-uncensored-v1-biased-bt Text Generation • 3B • Updated 19 days ago • 6
ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-seed-bt Text Generation • Updated 18 days ago • 8
ZDCSlab/ripd-anthropic-saferlhf-dolphin3-llama31-8b-biased-bt Text Generation • Updated 18 days ago • 6
kuririrn/qwen3-4b-agent-trajectory-lora-sft_multi_dpo_merged Text Generation • 4B • Updated 12 days ago