Peri-LN: Revisiting Layer Normalization in the Transformer Architecture Paper • 2502.02732 • Published Feb 4 • 1