arXiv 2505.18350

How Many Parameters Does Your Task Really Need? Task Specific Pruning with LLM-Sieve

By Waleed Reda, Abhinav Jangda, et al.

Published 2025-05-23

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

As Large Language Models (LLMs) are increasingly deployed for narrow tasks in resource-constrained settings, a central question arises: how much of an LLM is truly necessary for a given task? We present LLM-Sieve, a framework that prunes LLMs down to the minimal parameter subset needed to preserve task performance. Our approach introduces two innovations: (i) output-aligned non-orthogonal projections, which yield mo…

View the original paper on arXiv