A Survey on Model Compression for Large Language Models Paper • 2308.07633 • Published Aug 15, 2023 • 3
Model Compression and Efficient Inference for Large Language Models: A Survey Paper • 2402.09748 • Published Feb 15 • 1