ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-NVFP4
0.8B
•
Updated
•
27
High-quality QAT FP4 models to use with the fp_quant vLLM/Transformers integration on Blackwell NVIDIA GPUs. See https://arxiv.org/abs/2509.23202