arXiv 2505.18350
How Many Parameters Does Your Task Really Need? Task Specific Pruning with LLM-Sieve
By Waleed Reda, Abhinav Jangda, et al.
Published 2025-05-23
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
As Large Language Models (LLMs) are increasingly deployed for narrow tasks in resource-constrained settings, a central question arises: how much of an LLM is truly necessary for a given task? We present LLM-Sieve, a framework that prunes LLMs down to the minimal parameter subset needed to preserve task performance. Our approach introduces two innovations: (i) output-aligned non-orthogonal projections, which yield mo…