scaling / c4_original-d=512_l=8_h=4-2.0
sagadre
overtraining model release
6ade3a7