arXiv 2403.15447
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
By Junyuan Hong, Jinhao Duan, et al.
Published 2024-03-18
Discussion
Read the public discussion and references gathered around this paper.
Compressing high-capability Large Language Models (LLMs) has emerged as a favored strategy for resource-efficient inferences. While state-of-the-art (SoTA) compression methods boast impressive advancements in preserving benign task performance, the potential risks of compression in terms of safety and trustworthiness have been largely neglected. This study conducts the first, thorough evaluation of three (3) leading…