dbrx-instruct fp8 quantization

#1
by kewang2 - opened
No description provided.

initial fp8 quantized version of dbrx-moe

kewang2 changed pull request status to open
kewang2 changed pull request status to merged
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment