Classify images in real-time using your webcam
multilingual instruct model verifiably trained on open data
interact with videos !
Retrieve videos of human motions based on text input
Describe math images and answer questions
Conversational speech generation