ISTA-DASLab/Qwen3-30B-A3B-Instruct-2507-W4A4-mxfp4-rtn-identity-transform
17B
•
Updated
•
12
None defined yet.
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm