arXiv 2409.03444
Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities
By Wei Lu, Rachel K. Luu, et al.
Published 2024-09-05
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
The advancement of Large Language Models (LLMs) for domain applications in fields such as materials science and engineering depends on the development of fine-tuning strategies that adapt models for specialized, technical capabilities. In this work, we explore the effects of Continued Pretraining (CPT), Supervised Fine-Tuning (SFT), and various preference-based optimization approaches, including Direct Preference Op…