Macropodus
/

MWP-Instruct

Model card Files Files and versions Metrics Training metrics Community

Macropodus commited on Aug 24, 2023

Commit

4eb108b

·

1 Parent(s): 8fbc569

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -1,6 +1,9 @@
 # chatglm-maths
 chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu
 ## 踩坑
 ```python
 1. eps=1e-5(不要改小), 半精度float16, 以及LN采用的是Post-LN(泛化性更好) + DeepNorm, 【害, Attention前也有LN】目的是大模型为了防止梯度溢出等;

 # chatglm-maths
 chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu
+# Github
+ [https://github.com/yongzhuo/chatglm-maths](https://github.com/yongzhuo/chatglm-maths)
 ## 踩坑
 ```python
 1. eps=1e-5(不要改小), 半精度float16, 以及LN采用的是Post-LN(泛化性更好) + DeepNorm, 【害, Attention前也有LN】目的是大模型为了防止梯度溢出等;