amd-quark/tiny-llama-fast-tokenizer
amd-quark/llama-tiny-fp8-quark-quant-method
17.1M
•
Updated
•
2.02k
amd-quark/llama-tiny-fp8-quant-method
17.1M
•
Updated
•
1.99k
amd-quark/quark-assets
Updated
amd-quark/quark-legacy-int8
1.03M
•
Updated
amd-quark/quark-legacy-fp8
1.03M
•
Updated
amd-quark/quark-legacy-awq
16.7M
•
Updated
•
5
amd-quark/dummy-config-awq
Updated
•
1.23k
amd-quark/llama-small-int4-per-group-sym-awq
16.7M
•
Updated
•
523
amd-quark/llama-tiny-int4-per-group-sym
1.03M
•
Updated
•
523
amd-quark/llama-tiny-w-fp8-a-fp8-o-fp8
1.03M
•
Updated
•
511
amd-quark/llama-tiny-w-fp8-a-fp8
1.03M
•
Updated
•
518
amd-quark/llama-tiny-w-int8-b-int8-per-tensor
1.03M
•
Updated
•
523
amd-quark/llama-tiny-w-int8-per-tensor
1.03M
•
Updated
•
524
amd-quark/test-qdq
Updated