What speed do you get at Q8 on AMD Ryzen™ AI Max+ 395
11
#14 opened 6 months ago
by
akierum
Can we create a ..."GLM-4.6-Distill-GLM-4.5-Air-GGUF"?
3
#13 opened 7 months ago
by
NKLAR5
model has unused tensor on UD-IQ2_M: Is it normal?
👍 2
#12 opened 7 months ago
by
engrtipusultan
Corrected jinja template with tool Support works with PR llama.cpp/pull/15186
❤️ 2
16
#9 opened 8 months ago
by
xbruce22
Fixed 🏆 GLM Tool calling support in llama.cpp, raised PR
👀 1
4
#8 opened 8 months ago
by
xbruce22
Smashed 💪 Scored to 82.86 🔥2bit IQ2_M on MMLU Pro single shot benchmark
🔥❤️ 2
5
#7 opened 8 months ago
by
xbruce22
Scored 72.86 2bit IQ2_M on MMLU Pro single shot (reasoning enabled)
❤️🔥 1
1
#6 opened 8 months ago
by
xbruce22
Error in ollama
👍 1
#5 opened 9 months ago
by
Sam1989
llama.cpp\src\llama-kv-cache-unified.cpp:226: GGML_ASSERT(seq_id >= 0 && (size_t) seq_id < seq_to_stream.size()) failed
2
#4 opened 9 months ago
by
devold
Missing GLM-4.5-Air-UD-IQ3_XXS.gguf
➕ 3
#3 opened 9 months ago
by
BVEsun
unused tensors?
➕ 1
2
#2 opened 9 months ago
by
jacek2024
Tool Calls Not Working with –jinja Option
11
#1 opened 9 months ago
by
TNohSam