NVFP4 quantization?

#2
by maleal - opened

Hey, thanks a lot for this awesome quantized models. I've found AWQ quantization super helpful.

Do you have in roadmap to quantize in NVFP4? I'm interested in Qwen3 VL and I see nvfp4 has very little loss of accuracy.

Sign up or log in to comment