arXiv 2403.15447

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

By Junyuan Hong, Jinhao Duan, et al.

Published 2024-03-18

Discussion

Read the public discussion and references gathered around this paper.

Compressing high-capability Large Language Models (LLMs) has emerged as a favored strategy for resource-efficient inferences. While state-of-the-art (SoTA) compression methods boast impressive advancements in preserving benign task performance, the potential risks of compression in terms of safety and trustworthiness have been largely neglected. This study conducts the first, thorough evaluation of three (3) leading…

View the original paper on arXiv