Evaluate open-ended outputs from AI models using MM-Vet
Generate text by combining an image and a question
Generate responses using images and text
Meta Llama3 8b with Llava Multimodal capabilities
Chat with Qwen, get text responses
Upload and evaluate video models