ReluLLaMA-70B-PowerInfer-GGUF

This model is the downstream distribution of SparseLLM/ReluLLaMA-70B in PowerInfer GGUF format consisting of the LLM model weights and predictor weights.

Downloads last month
94
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no pipeline_tag.