HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-test-1 3B • Updated Jun 16 • 4
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-test-token-specific 3B • Updated Jun 17 • 4
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-test-token-specific-5-epoch 3B • Updated Jun 23 • 3
mlx-community/DeepSeek-Coder-V2-Lite-Instruct-4bit-AWQ Text Generation • 16B • Updated Jun 27 • 1.05k
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-token-specific-3-scaled 3B • Updated Jul 1 • 4
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-token-specific 3B • Updated Jul 1 • 3
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-diff-info-Distill-token-specific 16B • Updated Jul 10 • 3
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-diff-info-Distill-token-specific-scale 16B • Updated Jul 10 • 3
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-diff-info-Distill-mixture 16B • Updated Jul 10 • 3
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-diff-info-Distill-forward-kl 16B • Updated Jul 10 • 4
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-test-may 3B • Updated Jul 14 • 2