Unable to load Q5_K_M model in Ollama 0.12.5 - Error 500: unable to load model blob

by acrosley - opened 20 days ago

20 days ago

Getting a persistent 500 Internal Server Error: unable to load model when attempting to run the Q5_K_M quantization of this model with Ollama.

Pull the model: ollama pull hf.co/yairpatch/Qwen3-VL-30B-A3B-Instruct-GGUF:Q5_K_M
Model downloads successfully (all blobs at 100%)
Run: ollama run hf.co/yairpatch/Qwen3-VL-30B-A3B-Instruct-GGUF:Q5_K_M

Model should load and start an interactive chat session.

Error: 500 Internal Server Error: unable to load model: D:\OllamaModels\blobs\sha256-a3dcf99539e09f8a9f5578508bc0b834f62b0bd85e4764d56e942a9d89def85b

Any guidance would be appreciated!

ps- ty cursor for writing this

yair patch org 20 days ago

It is not compatible with Ollama at this point.

16 days ago

How to run it locally?

16 days ago

I couldn't figure out how to run it locally and decided to just use 8B-instruct which is actually still really good

intlex

15 days ago

llama.cpp-tr-qwen3-vl-3-b6981-ab45b1a supports conversion to GGUF format and can be tested using llama-mtmd-cli.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment