Batch inference

by Pavelrst - opened 6 days ago

6 days ago

Hey, quick question - is this model supposed to run faster on GPU when batch_size > 1?
I've tried to run it with batch_size = 2,4,8 and measure the time of .forward but always got slightly slower inference.
Any idea why?

GaboxR67

Owner 2 days ago

i don't know to be honest, depends on your gpu vram i guess

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment