arXiv 2505.16703

Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs

By Zeping Yu and Sophia Ananiadou

Published 2025-05-22

Citation lineage

Review the prior work and downstream research connected to this paper.

Although multimodal large language models (MLLMs) have achieved impressive performance, the multimodal instruction tuning stage often causes catastrophic forgetting of the base LLM's language ability, even in strong models like Llama3. To address this, we propose Locate-then-Merge, a training-free parameter fusion framework that first locates important parameters and then selectively merges them. We further introduc…

View the original paper on arXiv