arXiv 2305.11627

LLM-Pruner: On the Structural Pruning of Large Language Models

By Xinyin Ma, Gongfan Fang, et al.

Published 2023-05-19

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Large language models (LLMs) have shown remarkable capabilities in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges in both the deployment, inference, and training stages. With LLM being a general-purpose task solver, we explore its compression in a task-agnostic manner, which aims to preserve the multi-tas…

View the original paper on arXiv