Kai Chen's picture

8 9 9

Kai Chen

KaiChen1998

·

https://kaichen1998.github.io/

AI & ML interests

Omni-modal Large Language Models & Controllable Visual World Modeling & Autonomous Driving

Recent Activity

upvoted an article about 2 months ago

Why Did MiniMax M2 End Up as a Full Attention Model?

upvoted an article 6 months ago

🕳️ Attention Sinks in LLMs for endless fluency

upvoted a collection 7 months ago

View all activity

Organizations

commented a paper 8 months ago

Perceptual Decoupling for Scalable Multi-modal Reasoning via Reward-Optimized Captioning

Paper • 2506.04559 • Published Jun 5, 2025 • 2 •

New activity in Emova-ollm/Qwen2.5-7B-Instruct_add_speech_token_4096_nostrip 10 months ago

Improve language tag

#1 opened 10 months ago by

commented 5 papers 11 months ago

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Paper • 2409.18042 • Published Sep 26, 2024 • 39 •

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Paper • 2409.18042 • Published Sep 26, 2024 • 39 •

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Paper • 2409.18042 • Published Sep 26, 2024 • 39 •

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Paper • 2409.18042 • Published Sep 26, 2024 • 39 •

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Paper • 2409.18042 • Published Sep 26, 2024 • 39 •

New activity in KaiChen1998/geodiffusion-coco-stuff-512x512 about 1 year ago

Adding `safetensors` variant of this model

#1 opened about 1 year ago by

New activity in KaiChen1998/geodiffusion-nuimages-512x512 about 1 year ago

Adding `safetensors` variant of this model

#1 opened about 1 year ago by

New activity in KaiChen1998/coda-lm over 1 year ago

图片标注对齐

#1 opened over 1 year ago by

commented 2 papers over 1 year ago

Baichuan-Omni Technical Report

Paper • 2410.08565 • Published Oct 11, 2024 • 87 •

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Paper • 2409.18042 • Published Sep 26, 2024 • 39 •