BKM1804/Qwen2.5-3B-d65684e0-356b-40dd-a7ac-fc187b29b01a-DPO_bs32-checkpoint Updated about 1 month ago
nate-rahn/0730-rm_filter_judgment_data-qwen25_3b-hf Text Classification • 3B • Updated 25 days ago • 9
RylanSchaeffer/mem_model_Qwen2.5-3B_dataset_minerva_math_epochs_32_seed_0 Text Generation • 3B • Updated 11 days ago • 72
RylanSchaeffer/mem_model_Qwen2.5-3B_dataset_minerva_math_epochs_100_seed_0 Text Generation • 3B • Updated 10 days ago • 57