---
license: apache-2.0
tags:
- unsloth
- trl
- sft
- cot
- reasoning
- think
base_model:
- Qwen/Qwen3-4B
datasets:
- open-r1/Mixture-of-Thoughts
- PSM24/gemini-2.5-pro-100x
---
# Model Info: 
A small qwen 3 model trained on 34000 data collected from open-r1/mixture-of-thought.

# Usage: 
- Solve math
- Generate codes
- Thinking