Multi-image

by pbarker - opened Sep 26, 2024

Discussion

pbarker

Sep 26, 2024

Can this support multi-image?

chrisc36

Ai2 org Sep 26, 2024

•

edited Sep 26, 2024

The model was not trained on any multi-image data, and the preprocessor in this codebase does not currently support interleaved image/text messages.

The model's design does, in principle, allow it to handle multiple images as input by concatenating them into a very long input sequence, so it is still possible to try multi-image input (although it would require tweaking the preprocessor). However we have not experimented with this ourselves.

Florianeuler

Sep 26, 2024

Would be nice to have such a feature (especially for a multimodal RAG scenario...)

chrisc36 changed discussion status to closed Oct 4, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment