0.01_4iters_bs256_nodpo_full6w_userresponse_iter_1 / model-00001-of-00003.safetensors

Commit History

Model save
5f2fcf8
verified

ShenaoZhang commited on