arXiv 2305.11627

LLM-Pruner: On the Structural Pruning of Large Language Models

By Xinyin Ma, Gongfan Fang, et al.

Published 2023-05-19

Citation lineage

Review the prior work and downstream research connected to this paper.

Large language models (LLMs) have shown remarkable capabilities in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges in both the deployment, inference, and training stages. With LLM being a general-purpose task solver, we explore its compression in a task-agnostic manner, which aims to preserve the multi-tas…

View the original paper on arXiv