Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

moonshotai
/
Moonlight-16B-A3B-Instruct

Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
text-generation-inference
Model card Files Files and versions Community
14
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

PEFT finetuning support

#14 opened about 1 month ago by
NePe

Model output is !!!!!!!!, I am using VLLM

#13 opened about 1 month ago by
David3698

Which custom code this model wants to run? Could you please explain me? I was using Llama.cpp.

👍 1
1
#12 opened about 1 month ago by
JLouisBiz

why the c-eval result is 76.8 for base model but only 38.9 for instruct model?

1
#8 opened 3 months ago by
xianf

eos_token_id is list not int

#7 opened 3 months ago by
ningpengtao

Awesome!

🔥 4
2
#6 opened 3 months ago by
SicariusSicariiStuff

Run this with chatllm.cpp

3
#5 opened 3 months ago by
J22
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs