kaitchup
/

Qwen3-0.6B-NVFP4

compressed-tensors

Model card Files Files and versions

bnjmnmarie commited on Sep 8

Commit

5576beb

·

verified ·

1 Parent(s): 10fe913

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ tags:
 datasets:
 - HuggingFaceH4/ultrachat_200k
 ---
-This is [Qwen/Qwen3-32B](https://huggingface.co/Qwen/Qwen3-0.6B) quantized with [LLM Compressor](https://github.com/vllm-project/llm-compressor) in 4-bit (NVFP4), weights and activations.
 The calibration step used 512 samples of up to 2048 tokens, chat template applied, from [HuggingFaceH4/ultrachat_200k](https://huggingface.co/datasets/HuggingFaceH4/ultrachat_200k).
 The quantization has been done, tested, and evaluated by The Kaitchup.

 datasets:
 - HuggingFaceH4/ultrachat_200k
 ---
+This is [Qwen/Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B) quantized with [LLM Compressor](https://github.com/vllm-project/llm-compressor) in 4-bit (NVFP4), weights and activations.
 The calibration step used 512 samples of up to 2048 tokens, chat template applied, from [HuggingFaceH4/ultrachat_200k](https://huggingface.co/datasets/HuggingFaceH4/ultrachat_200k).
 The quantization has been done, tested, and evaluated by The Kaitchup.