Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mohitsha
/
Llama-2-70b-chat-hf-FP8-KV-AMMO
like
0
Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
License:
llama2
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
README.md exists but content is empty.
Downloads last month
5
Safetensors
Model size
69B params
Tensor type
FP16
·
Chat template
Files info
Inference Providers
NEW
Text Generation
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Collection including
mohitsha/Llama-2-70b-chat-hf-FP8-KV-AMMO
FP8 KV Cache
Collection
Models with FP8 KV Cache Scales
•
6 items
•
Updated
Jul 4, 2024