W8A8 Quantization leads to wrong token

#31
by opter - opened

A wrong token, i.e., “游戏副本” frequently appears during infering process, but the other tokens are reasonable. Why does it happen?

Sign up or log in to comment