s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped Text Generation • Updated Apr 18 • 7
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped-correct-only-mean-token Text Generation • Updated Apr 19 • 10
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped-mean-token Text Generation • Updated Apr 19 • 5
luckeciano/Qwen-2.5-7B-RL-AC-BigLRv3-Fast-4-v5-Train-NoKL-Marg-NormAdv Text Generation • Updated Apr 20 • 4
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-grpo-epoch1 Text Generation • Updated 30 days ago • 1