quantization? fine-tune?

#68
by tcporco - opened

Did anyone make a quantized version of this, or fine tune it? Has anyone ever run it with anything other than 8 A100s?

exactly what i was wondering. If not, i dont understand how this differs from just using the chat bot. Its just an extension of the chat bot as an API to charge

Sign up or log in to comment