Spaces:

HuggingFaceH4
/

zephyr-chat

Paused

App Files Files

How is the model so fast and accurate?

#17

by Saugatkafley - opened Oct 31, 2023

Oct 31, 2023

I am really impressed by how fast it can generate excellent answers almost instantly. What was used behind this low latency inference?

Oct 31, 2023

This demo uses TGI https://github.com/huggingface/text-generation-inference as a backend.

Nov 1, 2023

Thank you so much! @olivierdehaene . It is really fast!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment