Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ISTA-DASLab 's Collections
FP-Quant QAT
MR-GPTQ
GGUF
Gemma3-GPTQ
QuEST
HIGGS
AQLM+PV
AQLM

FP-Quant QAT

updated 16 days ago

High-quality QAT FP4 models to use with the fp_quant vLLM/Transformers integration on Blackwell NVIDIA GPUs. See https://arxiv.org/abs/2509.23202

Upvote
-

  • ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-NVFP4

    0.8B • Updated 5 days ago • 27

  • ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-MXFP4

    0.8B • Updated 5 days ago • 22

  • ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-NVFP4

    2B • Updated 5 days ago • 28

  • ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-MXFP4

    2B • Updated 5 days ago • 27

  • ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-QAT-NVFP4

    5B • Updated 5 days ago • 43

  • ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-QAT-MXFP4

    5B • Updated 5 days ago • 38

  • ISTA-DASLab/Qwen3-0.6B-FPQuant-QAT-NVFP4

    Text Generation • 0.4B • Updated 16 days ago • 27

  • ISTA-DASLab/Qwen3-1.7B-FPQuant-QAT-NVFP4

    Text Generation • 1B • Updated 16 days ago • 16

  • ISTA-DASLab/Qwen3-4B-FPQuant-QAT-NVFP4

    Text Generation • 2B • Updated 16 days ago • 20

  • ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4

    5B • Updated 5 days ago • 31

  • ISTA-DASLab/Qwen3-8B-FPQuant-QAT-MXFP4

    5B • Updated 5 days ago • 58
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs