Exploring Model Kinship for Merging Large Language Models Paper • 2410.12613 • Published 25 days ago • 19
Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining Paper • 2410.08102 • Published about 1 month ago • 19
Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems Paper • 2408.16293 • Published Aug 29 • 24
From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty Paper • 2407.06071 • Published Jul 8 • 7
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell • Apr 28 • 37