models
133
Muadil/Llama-3.2-1B-Instruct_sum_DPO_140k_1_20ep_deneme
Text Generation
•
1B
•
Updated
•
8
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_3ep_4bit
Text Generation
•
1B
•
Updated
•
5
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_3ep_4bit
Text Generation
•
1B
•
Updated
•
6
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_1k_1_2ep_4bit
Text Generation
•
1B
•
Updated
•
6
Muadil/Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_2ep_4bit
Text Generation
•
1B
•
Updated
•
7
Muadil/Llama-3.2-1B-Instruct_sum_DPO_1k_1_2ep_4bit
Text Generation
•
1B
•
Updated
•
7
Muadil/Llama-3.2-1B-Instruct_sum_DPO_1k_1_1ep_4bit
Text Generation
•
1B
•
Updated
•
7
Muadil/Llama-3.2-1B-Instruct_sum_DPO_10k_1_1ep_4bit
Text Generation
•
1B
•
Updated
•
8
Muadil/Llama-3.2-1B-Instruct_sum_KTO_10k_1_2ep_4bit
Text Generation
•
1B
•
Updated
•
4
Muadil/Llama-3.2-1B-Instruct_sum_DPO_10k_1_2ep_4bit
Text Generation
•
1B
•
Updated
•
7
datasets
11
Muadil/dpo_formatted_openai_summary
Viewer
•
Updated
•
183k
•
20
Muadil/dpo_dataset_train_openai_summary
Viewer
•
Updated
•
176k
•
18
Muadil/ppo_datasets_summary
Viewer
•
Updated
•
176k
•
42
Muadil/kto_labeled_openai_summary
Viewer
•
Updated
•
365k
•
29
•
1
Muadil/cleaned_openai_summary_comparisons
Viewer
•
Updated
•
183k
•
30
Muadil/all_cleaned_openai_summarize_comparisons_train_val
Viewer
•
Updated
•
176k
•
41
Muadil/all_unique_cleaned_openai_summarize_comparisons_test
Viewer
•
Updated
•
6.24k
•
21
Muadil/old_all_cleaned_openai_summarize_comparisons_test
Viewer
•
Updated
•
6.24k
•
31
Muadil/old_all_cleaned_openai_summarize_comparisons_train_val
Viewer
•
Updated
•
176k
•
26
Muadil/old_all_unique_cleaned_openai_summarize_comparisons
Viewer
•
Updated
•
21k
•
28