Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
compressed-tensors

Quantized w/ llm-compressor using shisa-ai/shisa-v2-sharegpt as the calibration set.

Original model: shisa-ai/shisa-v2-llama3.1-405b

Downloads last month
9
Safetensors
Model size
406B params
Tensor type
BF16
·
F8_E4M3
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for shisa-ai/shisa-v2-llama3.1-405b-FP8-Dynamic

Datasets used to train shisa-ai/shisa-v2-llama3.1-405b-FP8-Dynamic

Collection including shisa-ai/shisa-v2-llama3.1-405b-FP8-Dynamic