Doesn't give response

it gets stuck on "generation_output = model.generate(
input_ids=input_ids,
generation_config=generation_config,
return_dict_in_generate=True,
output_scores=False,
max_new_tokens=max_new_tokens,
)"

ianuvrat

Oct 11, 2023

@pipizhao , how can I initialize quantized model of this? I can't find the tokenizer of quanitzed model (by TheBloke) . Would you be kind enough to share sample code. TIA

pipizhao

Owner Oct 11, 2023

@jawadmohmmad try to use GPU, my friend. :>

ianuvrat

Oct 11, 2023

•

edited Oct 11, 2023

@pipizhao , I tried, but did not got the answer, can you check my code here in Collab - https://colab.research.google.com/drive/12k7RVPmGpPfFNXY6fGvqtMoAVOtjz44o?usp=sharing

pipizhao

Owner Oct 11, 2023

@ianuvrat Sorry, the GPTQ version is not officially provided by us. You can ask the creator of the model.

pipizhao changed discussion status to closed Oct 15, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment