moonshotai/Kimi-K2-Instruct

11 days ago

•

Hello!

Fantastic model, great job on it. Great logic, nice writing style, fantastic creative writing, top coder. It's the best local model for sure right now.

However, just like the DeepSeek model it lacks multimodality or even omnimodality. For many, these are important use cases, video/image/audio/text in, audio/text out would be enough.

Without sacrifcing text generation performance, it would be a killer model. (Even though I can't run it, I really want downscaled versions of this model for consumer systems)

Have a good day, your work is appreciated.

xxr3376

Moonshot AI org 11 days ago

There will be a multimodal model of course!

AlexCoder9

3 days ago

Great model - great work by your team - eagerly awaiting the multimodal version.

moonshotai
/

Kimi-K2-Instruct

Multimodality