--- license: apache-2.0 tags: - unsloth - trl - sft - cot - reasoning - think base_model: - Qwen/Qwen3-4B datasets: - open-r1/Mixture-of-Thoughts - PSM24/gemini-2.5-pro-100x --- # Model Info: A small qwen 3 model trained on 34000 data collected from open-r1/mixture-of-thought. # Usage: - Solve math - Generate codes - Thinking