Inference Providers
Active filters: fp4
nvidia/Qwen3.6-35B-A3B-NVFP4
Text Generation
• 19B • Updated • 470k
• 149
Text Generation
• 382B • Updated • 18.8k
• 31
nvidia/Qwen3.5-122B-A10B-NVFP4
Text Generation
• 65B • Updated • 12
• 5
Text Generation
• Updated • 758k
• 29
AEON-7/Step-3.7-Flash-AEON-Ultimate-Abliterated-NVFP4
Image-Text-to-Text
• 104B • Updated • 485
• 4
RedHatAI/gemma-4-31B-it-NVFP4
Image-Text-to-Text
• 20B • Updated • 184k
• 47
AEON-7/Qwen3.6-35B-A3B-heretic-NVFP4
Image-Text-to-Text
• 21B • Updated • 91.2k
• 45
sakamakismile/LFM2.5-8B-A1B-NVFP4
Text Generation
• 5B • Updated • 582
• 3
tonera/FLUX.2-klein-9B-Nunchaku
Image-to-Image
• Updated • 1.93k
• 15
ussoewwin/Hybrid-Sensitivity-Weighted-Quantization-SDXL-fp8e4m3
Text-to-Image
• Updated • 7
tonera/waiNSFWIllustrious_v150
Text-to-Image
• Updated • 63
• 2
nvidia/Qwen3.5-397B-A17B-NVFP4
Text Generation
• Updated • 851k
• 98
OptimizeLLM/Qwen3.5-122B-A10B-heretic-MTP-NVFP4
Text Generation
• 74B • Updated • 4.76k
• 4
AEON-7/Gemma-4-31B-it-DECKARD-HERETIC-Uncensored-NVFP4-SVDQuant
Text Generation
• 19B • Updated • 668
• 2
FreedomAISVR/Qwen3.6-35B-A3B-NVFP4-GGUF
Image-Text-to-Text
• 35B • Updated • 984
• 3
OpenYourMind/Qwopus3.5-122B-A10B-Kimi-K2.6-destilled-abliterated-NVFP4
Image-Text-to-Text
• 74B • Updated • 1.3k
• 3
crushleorey/Qwopus3.6-27B-v2-NVFP4
Image-Text-to-Text
• 15B • Updated • 6.4k
• 3
TentaFlow/Bielik-1.5B-NVFP4
Text Generation
• 0.9B • Updated • 14
• 1
Text Generation
• 18B • Updated • 34
• 2
mengqin1/RedidreamNSFWI1-bnb-4bit
Text Generation
• 19B • Updated • 48
• 3
qingcheng-ai/Qwen3-32B-fp4
Text Generation
• 19B • Updated • 77
• 4
qingcheng-ai/Qwen3-8B-fp4
Text Generation
• 5B • Updated • 4
• 1
RedHatAI/Qwen3-30B-A3B-NVFP4
Text Generation
• 17B • Updated • 73.8k
• 2
RedHatAI/Llama-3.1-70B-Instruct-NVFP4
Text Generation
• 41B • Updated • 435
RedHatAI/Llama-3.1-70B-Instruct-NVFP4A16
Text Generation
• 41B • Updated • 4
Text Generation
• 19B • Updated • 12.9k
• 8
RedHatAI/Qwen3-32B-NVFP4A16
Text Generation
• 19B • Updated • 82
• 2
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
• 133B • Updated • 42.7k
• 18
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
• 16B • Updated • 51.7k
• 31