arXiv 2305.11627
LLM-Pruner: On the Structural Pruning of Large Language Models
By Xinyin Ma, Gongfan Fang, et al.
Published 2023-05-19
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Large language models (LLMs) have shown remarkable capabilities in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges in both the deployment, inference, and training stages. With LLM being a general-purpose task solver, we explore its compression in a task-agnostic manner, which aims to preserve the multi-tas…