tzervas/phi-4-bitnet-1.58b · Is it work to quantize an existing LLM into 1.58bit?

Is it work to quantize an existing LLM into 1.58bit?

by SilverJim - opened Jan 29

Jan 29

Hello, I wonder Is it work to quantize an existing LLM into 1.58bit?
About 1.58b quantization, I think it loses so much precision to quantize an existing LLM into 1.58bit, it is seems 1.58bit quantization can only get a good result when the LLM is trained as a 1.58bit model from beginning.

tzervas

Owner 1 day ago

I'm currently battling the quality loss and compounding errors issue. no luck yet, but once I sort it I'll update in place and update the readme and model card

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment