Collection of State-of-the-art FP8 Block Quantized Models
NM Testing
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
492
nm-testing/TinyLlama-1.1B-Chat-v1.0-MXFP4
0.6B
•
Updated
•
36
nm-testing/granite-4.0-h-small-FP8-block
Text Generation
•
33B
•
Updated
•
230
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Static-Asym-e2e
1B
•
Updated
•
143
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Asym-e2e
1B
•
Updated
•
139
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-e2e
0.4B
•
Updated
•
149
nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16_channel-e2e
0.4B
•
Updated
•
154
nm-testing/TinyLlama-1.1B-Chat-v1.0-w4a16-sym-awq-e2e
0.3B
•
Updated
•
146
nm-testing/TinyLlama-1.1B-Chat-v1.0-w4a16-asym-awq-e2e
0.3B
•
Updated
•
153
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-e2e
0.3B
•
Updated
•
166
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16_channel-e2e
0.3B
•
Updated
•
176