Can you provide Machine Specs

by kingabzpro - opened about 18 hours ago

Discussion

kingabzpro

about 18 hours ago

How many H100s are required to run this model locally and other parameters for hardware optimization.

aaron-newsome

about 12 hours ago

From the deployment guide:

The smallest deployment unit for Kimi-K2 FP8 weights with 128k seqlen on mainstream H200 or H20 platform is a cluster with 16 GPUs with either Tensor Parallel (TP) or "data parallel + expert parallel" (DP+EP).

https://github.com/MoonshotAI/Kimi-K2/blob/main/docs/deploy_guidance.md

lsw825

Moonshot AI org about 4 hours ago

The number of H100s needed at least is 16 with very short sequence length (only for simple testing). For a normal experience, 32 H100s are required.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment