Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
18.4
TFLOPS
13
1
57
Ruslan
uzvisa
Follow
cnrkaya's profile picture
Hastagaras's profile picture
gaytri1111's profile picture
8 followers
·
85 following
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
baidu/Unlimited-OCR
new
activity
about 2 months ago
Qwen/Qwen3.6-35B-A3B:
how to enable non-thinking mode of this model in llama.cpp?
reacted
to
eaddario
's
post
with 👍
about 2 months ago
Experimental global target bits‑per‑weight quantization of Qwen/Qwen3.6-27B and Qwen/Qwen3.6-35B-A3B. Unlike standard llama.cpp quantizations that rely on fixed type heuristics (e.g., Q4_K_M), the Target BPW approach optimizes per-tensor precision where it matters the most, and produces high quality models that meet a precise global file size target. Key Advantages: - VRAM Maximization: Can generate high quality models sized exactly to fit hardware constraints (e.g., fitting the model into exactly 24GB VRAM). - Data-Driven Precision: Quantization mix is determined by actual weight error sensitivity rather than hardcoded rules, often yielding better PPL/KLD size trade-offs. Full benchmarks (PPL, KLD, ARC, GPQA, MMLU, etc.) and methodology in the models' cards. https://huggingface.co/eaddario/Qwen3.6-27B-GGUF https://huggingface.co/eaddario/Qwen3.6-35B-A3B-GGUF
View all activity
Organizations
None yet
uzvisa
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
3 days ago
baidu/Unlimited-OCR
Image-Text-to-Text
•
3B
•
Updated
3 minutes ago
•
213k
•
1.15k
liked
a model
3 months ago
Tesslate/OmniCoder-9B
Text Generation
•
9B
•
Updated
Mar 13
•
3.88k
•
650
liked
18 models
4 months ago
steampunque/Qwen3-VL-8B-Instruct-MP-GGUF
8B
•
Updated
Feb 18
•
142
•
2
steampunque/gemma-3-12b-it-MP-GGUF
12B
•
Updated
Feb 18
•
30
•
1
steampunque/Ministral-3-8B-Instruct-2512-MP-GGUF
8B
•
Updated
Feb 18
•
11
•
1
steampunque/Qwen2.5-Coder-14B-Instruct-MP-GGUF
15B
•
Updated
Feb 18
•
16
•
1
tencent/HY-MT1.5-7B-GGUF
Translation
•
8B
•
Updated
Jan 7
•
988
•
56
allura-forge/Llama-3.3-8B-Instruct
8B
•
Updated
Dec 31, 2025
•
694
•
206
mradermacher/Nanbeige-4.1-Python-DeepThink-3B-GGUF
4B
•
Updated
Feb 18
•
132
•
3
deltakitsune/Nanbeige-4.1-Python-DeepThink-3B
Text Generation
•
4B
•
Updated
Feb 16
•
559
•
7
TheDrummer/Tiger-Gemma-12B-v3-GGUF
13B
•
Updated
Jul 9, 2025
•
1.44k
•
15
MuXodious/Nanbeige4.1-3B-PaperWitch-heresy
Text Generation
•
4B
•
Updated
Feb 19
•
7
•
4
gabriellarson/WEBGEN-4B-Preview-GGUF
Text Generation
•
4B
•
Updated
Sep 2, 2025
•
308
•
20
TheDrummer/Rocinante-X-12B-v1
12B
•
Updated
Jan 25
•
1.41k
•
85
TheDrummer/Rivermind-Lux-12B-v1
12B
•
Updated
May 6, 2025
•
22
•
22
t-tech/T-lite-it-2.1
Text Generation
•
8B
•
Updated
Dec 23, 2025
•
3.45k
•
•
20
Tesslate/UIGEN-X-8B
Text Generation
•
8B
•
Updated
Jul 18, 2025
•
38
•
•
67
Tesslate/WEBGEN-4B-Preview
Text Generation
•
4B
•
Updated
Sep 2, 2025
•
105
•
•
87
Nanbeige/Nanbeige4.1-3B
Text Generation
•
4B
•
Updated
Mar 25
•
4.61k
•
•
1.14k
TeichAI/Qwen3-8B-DeepSeek-v3.2-Speciale-Distill-GGUF
8B
•
Updated
Dec 10, 2025
•
3.96k
•
24
Load more