ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128-BitBLAS Text Generation • 5B • Updated Jul 22, 2024 • 10
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64-BitBLAS Text Generation • 3B • Updated Jul 22, 2024 • 13
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128-BitBLAS Text Generation • 3B • Updated Jul 22, 2024 • 21
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-BitBLAS Text Generation • 37B • Updated Jul 22, 2024 • 7
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64-BitBLAS Text Generation • 21B • Updated Jul 22, 2024 • 7
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-BitBLAS Text Generation • 20B • Updated Jul 22, 2024 • 8