Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mohitsha
/
Llama-2-70b-chat-hf-FP8-KV
like
0
Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
Inference Endpoints
License:
llama2
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
README.md exists but content is empty.
Downloads last month
5
Safetensors
Model size
69B params
Tensor type
FP16
·
Inference Providers
NEW
Text Generation
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.
Collection including
mohitsha/Llama-2-70b-chat-hf-FP8-KV
FP8 KV Cache
Collection
Models with FP8 KV Cache Scales
•
6 items
•
Updated
Jul 4, 2024