ISTA-DASLab/Llama-3.2-1B-Instruct-W4A4-mxfp4-rtn-identity-transform-sft-fp_quant
Updated
•
74
None defined yet.
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm