arXiv 2403.15447

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

By Junyuan Hong, Jinhao Duan, et al.

Published 2024-03-18

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Compressing high-capability Large Language Models (LLMs) has emerged as a favored strategy for resource-efficient inferences. While state-of-the-art (SoTA) compression methods boast impressive advancements in preserving benign task performance, the potential risks of compression in terms of safety and trustworthiness have been largely neglected. This study conducts the first, thorough evaluation of three (3) leading…

View the original paper on arXiv