GGUF for Ollama

by SwimTreeWire - opened Sep 2

Sep 2

I would like to use this with ollama. How can i make the GGUF from this repo?

Sep 2

This is a new architecture and support hasn't been merged into Llama.cpp yet.

Sep 2

how can this be achieved? can i somehow make a GGUF myself and upload?

Sep 2

Nope, that means someone has to write the support for the model in the backend itself. You can probably sub to https://github.com/ggml-org/llama.cpp/issues/15748 to get updates.

pd95

Oct 3

Changes have been merged to llama.cpp and are hopefully coming to Ollama 🥳

I've been experimenting today and wrote about how I experimentally run Apertus in Ollama on my Mac here: https://gist.github.com/pd95/7841bb5d15220773c4ca8666f024c7c9

mjaggi

Swiss AI Initiative org Oct 8

those will be working in ollama as well soon (but ollama has to first update to use the most recent llama.cpp code).

Oct 10

awesome!

Oct 21

loleg

Oct 21

Confirmed, thanks. See my blog post for more details & instructions https://log.alets.ch/110/#using-ollama

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment