Ruslan's picture

Ruslan

uzvisa

·

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

baidu/Unlimited-OCR

new activity about 2 months ago

Qwen/Qwen3.6-35B-A3B:how to enable non-thinking mode of this model in llama.cpp?

reacted to eaddario's post with 👍 about 2 months ago

Experimental global target bits‑per‑weight quantization of Qwen/Qwen3.6-27B and Qwen/Qwen3.6-35B-A3B. Unlike standard llama.cpp quantizations that rely on fixed type heuristics (e.g., Q4_K_M), the Target BPW approach optimizes per-tensor precision where it matters the most, and produces high quality models that meet a precise global file size target. Key Advantages: - VRAM Maximization: Can generate high quality models sized exactly to fit hardware constraints (e.g., fitting the model into exactly 24GB VRAM). - Data-Driven Precision: Quantization mix is determined by actual weight error sensitivity rather than hardcoded rules, often yielding better PPL/KLD size trade-offs. Full benchmarks (PPL, KLD, ARC, GPQA, MMLU, etc.) and methodology in the models' cards. https://huggingface.co/eaddario/Qwen3.6-27B-GGUF https://huggingface.co/eaddario/Qwen3.6-35B-A3B-GGUF

View all activity

Organizations

None yet

liked a model 3 days ago

baidu/Unlimited-OCR

Image-Text-to-Text • 3B • Updated 3 minutes ago • 213k • 1.15k

liked a model 3 months ago

Tesslate/OmniCoder-9B

Text Generation • 9B • Updated Mar 13 • 3.88k • 650

liked 18 models 4 months ago

steampunque/Qwen3-VL-8B-Instruct-MP-GGUF

8B • Updated Feb 18 • 142 • 2

steampunque/gemma-3-12b-it-MP-GGUF

12B • Updated Feb 18 • 30 • 1

steampunque/Ministral-3-8B-Instruct-2512-MP-GGUF

8B • Updated Feb 18 • 11 • 1

steampunque/Qwen2.5-Coder-14B-Instruct-MP-GGUF

15B • Updated Feb 18 • 16 • 1

tencent/HY-MT1.5-7B-GGUF

Translation • 8B • Updated Jan 7 • 988 • 56

allura-forge/Llama-3.3-8B-Instruct

8B • Updated Dec 31, 2025 • 694 • 206

mradermacher/Nanbeige-4.1-Python-DeepThink-3B-GGUF

4B • Updated Feb 18 • 132 • 3

deltakitsune/Nanbeige-4.1-Python-DeepThink-3B

Text Generation • 4B • Updated Feb 16 • 559 • 7

TheDrummer/Tiger-Gemma-12B-v3-GGUF

13B • Updated Jul 9, 2025 • 1.44k • 15

MuXodious/Nanbeige4.1-3B-PaperWitch-heresy

Text Generation • 4B • Updated Feb 19 • 7 • 4

gabriellarson/WEBGEN-4B-Preview-GGUF

Text Generation • 4B • Updated Sep 2, 2025 • 308 • 20

TheDrummer/Rocinante-X-12B-v1

12B • Updated Jan 25 • 1.41k • 85

TheDrummer/Rivermind-Lux-12B-v1

12B • Updated May 6, 2025 • 22 • 22

t-tech/T-lite-it-2.1

Text Generation • 8B • Updated Dec 23, 2025 • 3.45k • • 20

Tesslate/UIGEN-X-8B

Text Generation • 8B • Updated Jul 18, 2025 • 38 • • 67

Tesslate/WEBGEN-4B-Preview

Text Generation • 4B • Updated Sep 2, 2025 • 105 • • 87

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated Mar 25 • 4.61k • • 1.14k

TeichAI/Qwen3-8B-DeepSeek-v3.2-Speciale-Distill-GGUF

8B • Updated Dec 10, 2025 • 3.96k • 24