jln.shk_64x4: JSAE (resid_mid -> resid_out) 7.5k steps
j.mlp_layer.shk_64x4: JSAE (mlp_in -> mlp_out) 7.5k steps
David Quarel
davidquarel
·
AI & ML interests
None yet
Recent Activity
updated
a model
about 23 hours ago
davidquarel/jaxgmg_ckpt_pt
updated
a model
22 days ago
davidquarel/jaxgmg_open_alpha0_gamma_sweep
published
a model
22 days ago
davidquarel/jaxgmg_open_alpha0_gamma_sweep