rl-rag/qwen3-4b-it-combined-sft-training-data-v20250824_MiroSystemPrompt Text Generation • 4B • Updated Sep 2 • 15