arXiv 2305.11627
LLM-Pruner: On the Structural Pruning of Large Language Models
By Xinyin Ma, Gongfan Fang, et al.
Published 2023-05-19
Citation lineage
Review the prior work and downstream research connected to this paper.
Large language models (LLMs) have shown remarkable capabilities in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges in both the deployment, inference, and training stages. With LLM being a general-purpose task solver, we explore its compression in a task-agnostic manner, which aims to preserve the multi-tas…