HackAI-2025
/

Qwen3_1.7B-GRPO-math-reasoning

Adding `safetensors` variant of this model

#1 opened 6 months ago by