inference speed

by nilx21 - opened Dec 5, 2024

Dec 5, 2024

I've fine tuned this model on a custom dataset using LoRA then merged the weights by setting save_merged_lora_model=True.
When I tried to do an inference using the fine-tuned model, I've noticed the inference speed is really slow.
Would you have some ideas on why this happens?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment