Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
neuralmagic
/
Meta-Llama-3.1-8B-quantized.w8a8
like
1
Follow
Neural Magic
162
Text Generation
Transformers
Safetensors
8 languages
llama
int8
vllm
quantized
8-bit precision
text-generation-inference
Inference Endpoints
compressed-tensors
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
Update README.md
#1
by
nm-research
- opened
Aug 21
base:
refs/heads/main
←
from:
refs/pr/1
Discussion
Files changed
+1
-0
nm-research
Neural Magic org
Aug 21
No description provided.
Update README.md
8708ad84
alexmarques
changed pull request status to
closed
Aug 21
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment