 
				ISTA-DASLab/NVIDIA-Nemotron-Nano-9B-v2-W4A4-nvfp4-gptq-identity-transform-actorder
		
				7B
			• 
	
				Updated
					
				
				• 
					
					15
				
	
				
				
None defined yet.

Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				 
				