Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
rpreite 's Collections
Working Embeddings model
Working Generation models

Working Generation models

updated Sep 18, 2025

Quantized models for llm on prem RnD

Upvote
-

  • rpreite/Qwen3-14B-BNB-INT4

    Text Generation • 15B • Updated Aug 27, 2025 • 3

  • rpreite/gemma-3-12b-it-BNB-INT4

    Image-Text-to-Text • Updated Sep 1, 2025 • 1

  • rpreite/Gemma3_GPTQ_W4A16

    13B • Updated Sep 15, 2025 • 1

  • rpreite/Qwen3_GPTQ_W4A16

    15B • Updated Sep 15, 2025 • 1

  • rpreite/Qwen3_BNB_8bit

    15B • Updated Sep 17, 2025 • 1

  • rpreite/Qwen3_AWQ_W4A16

    3B • Updated Feb 17 • 2

  • rpreite/Qwen3_GPTQ_SmoothQuant_W8A8

    15B • Updated Sep 17, 2025 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs