EMOVA Hugging Face

Enterprise

community

https://emova-ollm.github.io/

emova-ollm

AI & ML interests

Omni-modal Large Language Models, Multi-modal Large Language Models (MLLMs), Emotional spoken dialogue

Recent Activity

KaiChen1998 updated a Space 3 days ago

Emova-ollm/RACRO-demo

KaiChen1998 published a Space 3 days ago

Emova-ollm/RACRO-demo

KaiChen1998 authored a paper 12 days ago

Perceptual Decoupling for Scalable Multi-modal Reasoning via Reward-Optimized Captioning

View all activity

Organization Card

Community About org cards

👋 Welcome to EMOVA! We are a team focusing on fully open-sourced omni-modal foundational models with visual, textual, and speech capabilities. EMOVA (EMotionally Omni-present Voice Assistant) is a novel Omni-modal Large Language Model with end-to-end speech capabilities while maintaining state-of-the-art vision-language performance. We wish to promote the development of omni-modal human interactions with intelligent models!

Collections 2

spaces 2

Running on Zero

RACRO Online Interactive Demo

Live Interactive demo for RACRO-7B-CRO-GRPO backbone

Running on Zero

EMOVA Online Interactive Demo

Live Interactive demo for EMOVA with Qwen-2.5 backbone

models 13

Emova-ollm/Qwen2.5-7B-Instruct_add_speech_token_4096_nostrip

Feature Extraction • Updated Apr 28 • 20

Emova-ollm/emova-qwen-2-5-72b-hf

Feature Extraction • Updated Mar 14 • 13 • 2

Emova-ollm/emova-qwen-2-5-72b

Text Generation • Updated Mar 13 • 24 • 1

Emova-ollm/emova-qwen-2-5-7b-hf

Feature Extraction • Updated Mar 13 • 31 • 2

Emova-ollm/emova-qwen-2-5-7b

Text Generation • Updated Mar 13 • 24 • 1

Emova-ollm/emova-qwen-2-5-3b-hf

Feature Extraction • Updated Mar 13 • 19 • 5

Emova-ollm/emova-qwen-2-5-3b

Text Generation • Updated Mar 13 • 33 • 2

Emova-ollm/qwen2vit600m

Feature Extraction • Updated Mar 12 • 7.99k

Emova-ollm/Meta-Llama-3.1-8B-Instruct_add_speech_token_4096_nostrip-2

Feature Extraction • Updated Mar 12 • 11

Emova-ollm/Qwen2.5-3B-Instruct_add_speech_token_4096_nostrip

Text Generation • Updated Mar 12 • 21

datasets 5

Emova-ollm/emova-alignment-7m

Viewer • Updated Mar 14 • 6.18M • 6.73k • 1

Emova-ollm/emova-sft-speech-eval

Viewer • Updated Mar 14 • 3.76k • 46

Emova-ollm/emova-asr-tts-eval

Viewer • Updated Mar 14 • 5.24k • 22

Emova-ollm/emova-sft-speech-231k

Viewer • Updated Mar 14 • 231k • 287 • 2

Emova-ollm/emova-sft-4m

Viewer • Updated Mar 14 • 4.31M • 5.16k • 1