kotoba-tech/kotoba-whisper-v2.0 Automatic Speech Recognition • 0.8B • Updated Oct 23, 2024 • 5.53k • 65
Running on Zero 18 18 Llama-3-EvoVLM-JP-v2 🐠 Chat with images and text using a Japanese visual language model