The Pile: An 800GB Dataset of Diverse Text for Language Modeling Paper • 2101.00027 • Published Dec 31, 2020 • 7
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 62
view article Article Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach By oopere • Nov 24, 2024 • 7