ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-BitBLAS Text Generation • 20B • Updated Jul 22, 2024
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128-BitBLAS Text Generation • 5B • Updated Jul 22, 2024 • 15
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64-BitBLAS Text Generation • 3B • Updated Jul 22, 2024 • 1
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-BitBLAS Text Generation • 3B • Updated Jul 22, 2024 • 1
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-BitBLAS Text Generation • 37B • Updated Jul 22, 2024 • 3
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64-BitBLAS Text Generation • 21B • Updated Jul 22, 2024 • 10
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-GPTQ Text Generation • 2B • Updated Jul 22, 2024 • 5 • 1
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-GPTQ Text Generation • 11B • Updated Jul 22, 2024 • 1
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-GPTQ Text Generation • 7B • Updated Jul 22, 2024 • 11