arXiv 2305.11627
LLM-Pruner: On the Structural Pruning of Large Language Models
By Xinyin Ma, Gongfan Fang, et al.
Published 2023-05-19
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Large language models (LLMs) have shown remarkable capabilities in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges in both the deployment, inference, and training stages. With LLM being a general-purpose task solver, we explore its compression in a task-agnostic manner, which aims to preserve the multi-tas…