ISTA-DASLab/Llama-3.2-3B-AQLM-PV-2Bit-2x8
Text Generation
•
1B
•
Updated
None defined yet.
CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization