oceanpty
's Collections
oceanpty/TOA-Ultrafeedback-SFT-Rand-lla3.1-8b-inst
Viewer
•
Updated
•
59.9k
•
33
oceanpty/TOA-Ultrafeedback-SFT-Rand-qwen2-7b-inst
Viewer
•
Updated
•
59.9k
•
47
oceanpty/TOA-Ultrafeedback-SFT-PRS-lla3.1-8b-inst
Viewer
•
Updated
•
59.9k
•
54
oceanpty/TOA-Ultrafeedback-SFT-PRS-qwen2-7b-inst
Viewer
•
Updated
•
59.9k
•
29
oceanpty/TOA-Ultrafeedback-SFT-Ensemble-model-num-4
Viewer
•
Updated
•
59.9k
•
30
oceanpty/TOA-Ultrafeedback-SFT-SeqRefine-model-num-4
Viewer
•
Updated
•
59.9k
•
39
oceanpty/TOA-Ultrafeedback-SFT-MoA-model-num-4
Viewer
•
Updated
•
59.4k
•
52
oceanpty/TOA-Ultrafeedback-SFT-TOA-model-num-4
Viewer
•
Updated
•
59.8k
•
39
oceanpty/TOA-Ultrafeedback-DPO-TOA-model-num-4
Viewer
•
Updated
•
57.1k
•
37
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-Rand-lla31-8b-inst
8B
•
Updated
•
9
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-PRS-lla31-8b-inst
8B
•
Updated
•
4
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-ensemble
8B
•
Updated
•
2
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-SeqRefine
8B
•
Updated
•
6
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-MoA
8B
•
Updated
•
12
oceanpty/TOA-ultrafeedback-lla3-8b-inst-sft-data-small-scale-TOA
8B
•
Updated
•
8
oceanpty/TOA-ultrafeedback-lla3-8b-inst-dpo-data-small-scale-mcts-n-40-pi-0-ni-30
8B
•
Updated
•
7