arXiv 2305.11627

LLM-Pruner: On the Structural Pruning of Large Language Models

By Xinyin Ma, Gongfan Fang, et al.

Published 2023-05-19

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Large language models (LLMs) have shown remarkable capabilities in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges in both the deployment, inference, and training stages. With LLM being a general-purpose task solver, we explore its compression in a task-agnostic manner, which aims to preserve the multi-tas…

View the original paper on arXiv