arXiv 2403.15447
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression
By Junyuan Hong, Jinhao Duan, et al.
Published 2024-03-18
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Compressing high-capability Large Language Models (LLMs) has emerged as a favored strategy for resource-efficient inferences. While state-of-the-art (SoTA) compression methods boast impressive advancements in preserving benign task performance, the potential risks of compression in terms of safety and trustworthiness have been largely neglected. This study conducts the first, thorough evaluation of three (3) leading…