ggml-org/Qwen3-Reranker-0.6B-Q8_0-GGUF

This model was converted to GGUF format from Qwen/Qwen3-Reranker-0.6B using llama.cpp via the ggml.ai's GGUF-my-repo space. Refer to the original model card for more details on the model.

Downloads last month
207
GGUF
Model size
0.6B params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ggml-org/Qwen3-Reranker-0.6B-Q8_0-GGUF

Quantized
(40)
this model