|
--- |
|
pipeline_tag: any-to-any |
|
datasets: |
|
- openbmb/RLAIF-V-Dataset |
|
library_name: transformers |
|
language: |
|
- multilingual |
|
tags: |
|
- minicpm-o |
|
- omni |
|
- vision |
|
- ocr |
|
- multi-image |
|
- video |
|
- custom_code |
|
- audio |
|
- speech |
|
- voice cloning |
|
- live Streaming |
|
- realtime speech conversation |
|
- asr |
|
- tts |
|
--- |
|
|
|
<h1>A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone</h1> |
|
|
|
## MiniCPM-o 2.6 int4 |
|
This is the int4 quantized version of [**MiniCPM-o 2.6**](https://modelscope.cn/models/OpenBMB/MiniCPM-o-2_6). |
|
Running with int4 version would use lower GPU memory (about 9GB). |
|
|