Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

78

Base only

Active filters: gpu

AEON-7/Gemma-4-31B-it-DECKARD-HERETIC-Uncensored-NVFP4

Text Generation • 18B • Updated 18 days ago • 2.53k • 11

danielhanchen/unsloth-blackwell-docker

Updated 2 days ago • 1

AEON-7/Gemma-4-12B-it-AEON-Abliterated-K4-BF16

Text Generation • 12B • Updated 11 days ago • 2.44k • 25

HarmenWessels/gemma-4-12B-it-qat-int4-ov

Image-Text-to-Text • Updated 10 days ago • 517 • 1

Vishal74/Seq2SeqModel_LSTM

Updated Jun 4, 2024

Tech-Meld/gpus-everywhere

Text-to-Image • Updated Jun 26, 2024 • 7 • • 1

vhab10/llama_3.1_8b_Q4_K_M-gguf

Text Generation • 8B • Updated Oct 6, 2024 • 8

frameai/Loxa-4B

Text Generation • 4B • Updated Jan 14, 2025 • 7

mradermacher/Loxa-4B-GGUF

4B • Updated Jan 14, 2025 • 41

mradermacher/Loxa-4B-i1-GGUF

4B • Updated Jan 14, 2025 • 76

frameai/CodeLoxa-4B

Text Generation • 4B • Updated Jan 14, 2025 • 7

mradermacher/CodeLoxa-4B-GGUF

4B • Updated Jan 15, 2025 • 50 • 1

mradermacher/CodeLoxa-4B-i1-GGUF

4B • Updated Jan 15, 2025 • 135

frameai/Loxa-1.6B

Text Generation • 2B • Updated Jan 16, 2025 • 5 • 1

mradermacher/Loxa-1.6B-GGUF

2B • Updated Jan 16, 2025 • 29

mradermacher/Loxa-1.6B-i1-GGUF

2B • Updated Jan 16, 2025 • 194

frameai/Loxa-1.6B-uncensored

Text Generation • 2B • Updated Feb 2, 2025 • 3 • 1

mradermacher/Loxa-1.6B-uncensored-GGUF

2B • Updated Feb 3, 2025 • 62 • 2

mradermacher/Loxa-1.6B-uncensored-i1-GGUF

2B • Updated Feb 3, 2025 • 130

rhinosaur0/rapid3dgs

Updated Mar 22, 2025 • 1

ConfidentialMind/gte-multilingual-reranker-base-onnx-op14-opt-gpu-int8

Sentence Similarity • Updated Jul 7, 2025 • 8 • 1

ConfidentialMind/gte-multilingual-reranker-base-onnx-op14-opt-gpu

Sentence Similarity • Updated Jul 7, 2025 • 4

ConfidentialMind/gte-multilingual-reranker-base-onnx-op19-opt-gpu

Sentence Similarity • Updated Jul 7, 2025 • 11

langutang/protege-lg

Robotics • Updated Apr 26, 2025

sbeierle/fame-pytorch-kit

Updated Apr 28, 2025

excribe/classifer_sgd_longformer_4099

Text Classification • 0.1B • Updated May 6, 2025 • 4

lilbablo/humigencev2

Text Generation • Updated Oct 1, 2025

AhmedAyman/k2-think-cuda-1505

Text Generation • Updated Oct 26, 2025 • 2

Eltamuan/Gravitas-Torch-2.8-Blackwell-Edition

Updated Nov 3, 2025

magiccodingman/Qwen3-4B-Instruct-2507-MXFP4-Hybrid-GGUF

Text Generation • 4B • Updated Dec 3, 2025 • 83