arXiv 2505.00019

An Empirical Study on Prompt Compression for Large Language Models

By Zheng Zhang, Jinyi Li, et al.

Published 2025-04-24

Citation lineage

Review the prior work and downstream research connected to this paper.

Prompt engineering enables Large Language Models (LLMs) to perform a variety of tasks. However, lengthy prompts significantly increase computational complexity and economic costs. To address this issue, we study six prompt compression methods for LLMs, aiming to reduce prompt length while maintaining LLM response quality. In this paper, we present a comprehensive analysis covering aspects such as generation performa…

View the original paper on arXiv