deepseek-ai
/

DeepSeek-V2-Chat

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (1)

Dddv

#16 opened 3 months ago by

NAN issue using FP16 to load the model

#15 opened 5 months ago by

ImportError: This modeling file requires the following packages that were not found in your environment: flash_attn. Run `pip install flash_attn`

#14 opened 8 months ago by

How much memory is needed if you make the 128k context length

#13 opened 9 months ago by

Implement MLA inference optimizations to DeepseekV2Attention

#12 opened 10 months ago by

Can you provide a sample code for training with DeepSpeed ZeRO3?

#10 opened 10 months ago by

Ollama support

#9 opened 10 months ago by

MoE offloading strategy？

#8 opened 10 months ago by

Update README.md

#7 opened 10 months ago by

VanishingPsychopath

kv cache

#6 opened 10 months ago by

function/tool calling support

#5 opened 10 months ago by

fail to run the example

#4 opened 10 months ago by

GPTQ plz

#3 opened 10 months ago by

Parkerlambert123

vllm support

#2 opened 10 months ago by

llama.cpp support

#1 opened 10 months ago by