Q8_0 GGUF quantization of Ken3.5-9B (Qwen3.5-9B fine-tuned on Ken instruct data).
llama-server -m Ken3.5-9B-Q8_0.gguf -ngl 99 -c 4096
Chat template
8-bit
Base model