llama.cpp importance matrices for various language models. Trained on Wikitext-103 training data unless noted otherwise with the suffix indicating the number of input tokens.
llama.cpp importance matrices for various language models. Trained on Wikitext-103 training data unless noted otherwise with the suffix indicating the number of input tokens.