luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-FisherMaskGlobal-1e-12_5894 Text Generation • 8B • Updated Jun 25 • 4
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-HessianMaskToken-1.0_4859 Text Generation • 8B • Updated Jun 25 • 4
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-HessianMaskSentence-1e-3_2991 Text Generation • 8B • Updated Jun 25 • 3
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-HessianMaskToken-1.0_5849 Text Generation • 8B • Updated Jun 25 • 4
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-HessianMaskSentence-1e-3_3401 Text Generation • 8B • Updated Jun 25 • 4
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-HessianMaskSentence-1e-3_4992 Text Generation • 8B • Updated Jun 25 • 4
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-FisherMaskSentence-1e-7-HessianMaskSentence-1e-6_6372 Text Generation • 8B • Updated Jul 4 • 3
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-FisherMaskSentence-1e-7-HessianMaskSentence-1e-5_1855 Text Generation • 8B • Updated Jul 4 • 3
tensorblock/luckeciano_Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv-GGUF 8B • Updated 16 days ago • 168