-
-
-
-
-
-
Inference Providers
Active filters:
sparsity
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
1
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
4
•
1
RedHatAI/Sparse-Llama-3.1-8B-2of4
Text Generation
•
8B
•
Updated
•
67
•
62
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
2
•
3
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4
Text Generation
•
8B
•
Updated
•
5
•
1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4
Text Generation
•
8B
•
Updated
•
1
•
1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
2
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
2
bartowski/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
1.09k
•
3
QuantFactory/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
192
•
4
tensorblock/Sparse-Llama-3.1-8B-2of4-GGUF
Text Generation
•
8B
•
Updated
•
97
nintwentydo/pixtral-12b-2409-2of4-sparse
Image-Text-to-Text
•
13B
•
Updated
•
1
HangGuo/Llama2-70B-QuaRot-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
23
•
1
HangGuo/Llama2-70B-QuaRot-OBR-RTN-W4A4KV4S50
Text Generation
•
Updated
•
12
HangGuo/Llama2-70B-SpinQuant-OBR-RTN-W4A4KV4S50
Text Generation
•
Updated
•
3
HangGuo/Llama2-70B-SpinQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
3
HangGuo/Llama3-70B-SpinQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
3
HangGuo/Llama3-70B-SpinQuant-OBR-RTN-W4A4KV4S50
Text Generation
•
Updated
•
4
HangGuo/Llama3-70B-QuaRot-OBR-RTN-W4A4KV16S50
Text Generation
•
Updated
•
11
HangGuo/Llama3-70B-QuaRot-OBR-GPTQ-W4A4KV16S50
Text Generation
•
Updated
•
11
HangGuo/QWen2.5-7B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
9
HangGuo/QWen2.5-32B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
8
HangGuo/QWen2.5-1.5B-FlatQuant-OBR-GPTQ-W4A8KV16S50
Text Generation
•
Updated
•
6
HangGuo/QWen2.5-3B-FlatQuant-OBR-GPTQ-W4A8KV16S50
Text Generation
•
Updated
•
14
HangGuo/QWen2.5-3B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
10
HangGuo/QWen2.5-1.5B-FlatQuant-OBR-GPTQ-W4A4KV4S50
Text Generation
•
Updated
•
3