The Unreasonable Ineffectiveness of the Deeper Layers Paper • 2403.17887 • Published Mar 26 • 77 • 14
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study Paper • 2403.03186 • Published Mar 5 • 4 • 1
RLVF: Learning from Verbal Feedback without Overgeneralization Paper • 2402.10893 • Published Feb 16 • 10 • 2
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning Paper • 2012.13255 • Published Dec 22, 2020 • 3 • 1
Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks Paper • 1602.07868 • Published Feb 25, 2016 • 2 • 1