Yikang Shen PRO
YikangS
AI & ML interests
None yet
Organizations
YikangS's activity
When can we have the training code as illustrated in the paper.
12
#5 opened 11 months ago
by
Shamane

why not include Qwen1.5-MoE-A2.7B in the table?
1
#4 opened 11 months ago
by
J22
Dataset?
3
#1 opened 11 months ago
by
0xbitches
Adding `safetensors` variant of this model
#1 opened over 1 year ago
by
SFconvertbot

Adding `safetensors` variant of this model
#1 opened over 1 year ago
by
SFconvertbot
