Collection of State-of-the-art FP8 Block Quantized Models
NM Testing
company
AI & ML interests
None defined yet.
Recent Activity
models
486
nm-testing/llama2.c-stories42M-pruned2.4
Updated
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Static-Asym-e2e
1B
•
Updated
•
131
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Asym-e2e
1B
•
Updated
•
140
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-e2e
0.4B
•
Updated
•
149
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16_channel-e2e
0.4B
•
Updated
•
163
nm-testing/TinyLlama-1.1B-Chat-v1.0-w4a16-sym-awq-e2e
0.3B
•
Updated
•
131
nm-testing/TinyLlama-1.1B-Chat-v1.0-w4a16-asym-awq-e2e
0.3B
•
Updated
•
148
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-e2e
0.3B
•
Updated
•
171
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16_channel-e2e
0.3B
•
Updated
•
166
nm-testing/TinyLlama-1.1B-Chat-v1.0-actorder-weight-e2e
0.3B
•
Updated
•
155