arXiv 2505.18350
How Many Parameters Does Your Task Really Need? Task Specific Pruning with LLM-Sieve
By Waleed Reda, Abhinav Jangda, et al.
Published 2025-05-23
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
As Large Language Models (LLMs) are increasingly deployed for narrow tasks in resource-constrained settings, a central question arises: how much of an LLM is truly necessary for a given task? We present LLM-Sieve, a framework that prunes LLMs down to the minimal parameter subset needed to preserve task performance. Our approach introduces two innovations: (i) output-aligned non-orthogonal projections, which yield mo…