-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
mlx-community/MiniMax-M2-4bit
Text Generation
•
229B
•
Updated
•
941
•
7
dousery/medical-reasoning-gpt-oss-20b
Text Generation
•
21B
•
Updated
•
2.21k
•
43
mlx-community/GLM-4.6-4bit
Text Generation
•
353B
•
Updated
•
4.11k
•
11
mlx-community/chandra-4bit
Image-to-Text
•
Updated
•
131
•
4
Jalea96/DeepSeek-OCR-bnb-4bit-NF4
Image-Text-to-Text
•
3B
•
Updated
•
847
•
4
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
•
7B
•
Updated
•
100k
•
126
nightmedia/VCoder-120b-1.0-qx86-hi-mlx
Text Generation
•
117B
•
Updated
•
95
•
3
QuantTrio/MiniMax-M2-AWQ
Text Generation
•
Updated
•
166
•
3
TheBloke/MythoMax-L2-13B-GPTQ
Text Generation
•
2B
•
Updated
•
757
•
215
unsloth/DeepSeek-R1-Distill-Qwen-32B-bnb-4bit
Text Generation
•
18B
•
Updated
•
357k
•
27
gaunernst/gemma-3-27b-it-int4-awq
Image-Text-to-Text
•
6B
•
Updated
•
52k
•
33
Qwen/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
•
509k
•
113
Qwen/Qwen3-14B-AWQ
Text Generation
•
3B
•
Updated
•
136k
•
39
Qwen/Qwen3-235B-A22B-GPTQ-Int4
Text Generation
•
Updated
•
58.2k
•
25
unsloth/Qwen3-Next-80B-A3B-Instruct-bnb-4bit
Text Generation
•
Updated
•
43.7k
•
16
Intel/Qwen3-VL-235B-A22B-Instruct-int4-AutoRound
2B
•
Updated
•
77
•
2
unsloth/Qwen3-VL-8B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
9B
•
Updated
•
22.2k
•
7
Edison2525/Qwen3-8B-AWQ
8B
•
Updated
•
66
•
2
QuantTrio/Qwen3-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
33B
•
Updated
•
1.01k
•
2
mlx-community/LLaDA2.0-flash-preview-4bit
Text Generation
•
103B
•
Updated
•
31
•
2
mlx-community/DeepSeek-OCR-4bit
Image-Text-to-Text
•
0.8B
•
Updated
•
1.12k
•
2
TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ
Text Generation
•
2B
•
Updated
•
308
•
320
TheBloke/Phind-CodeLlama-34B-v2-GPTQ
Text Generation
•
5B
•
Updated
•
18
•
90
TheBloke/leo-hessianai-13B-chat-AWQ
Text Generation
•
2B
•
Updated
•
29
•
1
TheBloke/Psyfighter-13B-GPTQ
Text Generation
•
2B
•
Updated
•
9
•
7
TheBloke/Mistral-7B-Instruct-v0.2-AWQ
Text Generation
•
1B
•
Updated
•
56k
•
51
MaziyarPanahi/Mistral-7B-Instruct-Aya-101-GGUF
Text Generation
•
7B
•
Updated
•
326
•
10
macadeliccc/Hermes-2-Pro-Mistral-7B-AWQ
Text Generation
•
1B
•
Updated
•
4
•
1
hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4
Text Generation
•
59B
•
Updated
•
494
•
37
unsloth/gemma-2-2b-it-bnb-4bit
Text Generation
•
2B
•
Updated
•
7.23k
•
20