anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_126 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_140 Text Generation • Updated May 6 • 6
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_160 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_170 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_190 Text Generation • Updated May 6 • 9
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_210 Text Generation • Updated May 6 • 7
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_230 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_250 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_260 Text Generation • Updated May 6 • 7
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_280 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_294 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_310 Text Generation • Updated May 6 • 7
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_330 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_340 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_360 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_378 Text Generation • Updated May 6 • 7
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_390 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__countdown_deepseek_qwen_distilled_32b_dataset_epoch_410 Text Generation • Updated May 6 • 7
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_10 Text Generation • Updated May 6 • 8
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_30 Text Generation • Updated May 6 • 15
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_50 Text Generation • Updated May 6 • 12
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_60 Text Generation • Updated May 6 • 8
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_80 Text Generation • Updated May 6 • 13
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_100 Text Generation • Updated May 6 • 7
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_112 Text Generation • Updated May 6 • 13
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_130 Text Generation • Updated May 6 • 7
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_150 Text Generation • Updated May 6 • 16
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_168 Text Generation • Updated May 6 • 5
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_180 Text Generation • Updated May 6 • 20
anmolagarwal999/Qwen2.5-0.5B-Instruct__sft_saved__math_dataset_based_on_gt_reasoning_trace_epoch_200 Text Generation • Updated May 6 • 9