MikeRoz's picture
Update README.md
d7430d6 verified
|
raw
history blame
1.03 kB
metadata
library_name: exllamav2
license: apache-2.0
license_link: https://huggingface.co/Qwen/Qwen3-235B-A22B/blob/main/LICENSE
pipeline_tag: text-generation
base_model: Qwen/Qwen3-235B-A22B
base_model_relation: quantized
tags:
  - exl2

Exllamav2 quantization of Qwen/Qwen3-235B-A22B

Quantized using commit 68976a0 of the dev branch of exllamav2. Support for this architecture does not appear to be in the main branch as of this writing. To use this model, please either build the dev branch from source, or wait for a future release.

2.25 bpw h6 63.580 GiB
3.00 bpw h6 83.410 GiB
4.00 bpw h6 110.628 GiB
measurement.json