What datatype was used to train the model? Was it bfloat16 or simple float16?
· Sign up or log in to comment