Weiguo Liao
Weiguo
AI & ML interests
None yet
Organizations
None yet
Weiguo's activity
Add support for flash-attention2
2
#3 opened almost 2 years ago
by
shigureui

有没有人写一个stream_chat,现在的体验有点差
2
11
#6 opened almost 2 years ago
by
Weiguo
貌似很拉跨,一个7B的模型3090显存都不够载入,要是不安装它推荐的加速包,速度慢的像狗。
15
#12 opened almost 2 years ago
by
boxter007
BF16是不是依赖CUDA 11.7,我的机器是12.2
3
#7 opened almost 2 years ago
by
Weiguo
有没有人写一个stream_chat,现在的体验有点差
2
11
#6 opened almost 2 years ago
by
Weiguo
BF16是不是依赖CUDA 11.7,我的机器是12.2
3
#7 opened almost 2 years ago
by
Weiguo
有没有人写一个stream_chat,现在的体验有点差
2
11
#6 opened almost 2 years ago
by
Weiguo