DeepSeek-V3.1-Terminus-AWQ

by panweiguo - opened Oct 5

Discussion

panweiguo

Oct 5

Can the DeepSeek-V3.1-Terminus-AWQ version of the model be quantized? Thank you.

JunHowie

QuantTrio org Oct 7

The results didn’t meet our expectations — the model tends to generate incoherent outputs.
Even with the FP16Mix strategy, it still requires a precision close to 8-bit to respond properly.
Considering this, we’ve decided not to release the DeepSeek-V3.1-Terminus-AWQ.

JunHowie changed discussion status to closed Oct 7

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment