s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped-correct-only-mean-token Text Generation • 8B • Updated Apr 19 • 6
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response-grouped-mean-token Text Generation • 8B • Updated Apr 19 • 4
luckeciano/Qwen-2.5-7B-RL-AC-BigLRv3-Fast-4-v4-Train-ConstLR Text Generation • 8B • Updated Apr 18 • 3
luckeciano/Qwen-2.5-7B-RL-AC-BigLRv3-Fast-4-v5-Train-NoKL-Marg Text Generation • 8B • Updated Apr 19 • 3
luckeciano/Qwen-2.5-7B-RL-AC-BigLRv3-Fast-4-v5-Train-NoKL-Marg-NormAdv Text Generation • 8B • Updated Apr 20 • 3
chenggong1995/Qwen2.5-Math-7B-gen8-math3to5_olympiads_aime-grpo-epoch1 Text Generation • 8B • Updated Apr 24 • 5