GGUF llama.cpp quantized version of:
- Original model: Qwen3-4B-Instruct-2507
- Model creator: Qwen
- License
Recommended Prompt Format (chatml)
<|im_start|>system
Provide some context and/or instructions to the model.<|im_end|>
<|im_start|>user
The user’s message goes here<|im_end|>
<|im_start|>assistant
AI message goes here<|im_end|>
- Downloads last month
- 12
Hardware compatibility
Log In
to view the estimation
4-bit
5-bit
8-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support