arXiv 2409.03444

Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities

By Wei Lu, Rachel K. Luu, et al.

Published 2024-09-05

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

The advancement of Large Language Models (LLMs) for domain applications in fields such as materials science and engineering depends on the development of fine-tuning strategies that adapt models for specialized, technical capabilities. In this work, we explore the effects of Continued Pretraining (CPT), Supervised Fine-Tuning (SFT), and various preference-based optimization approaches, including Direct Preference Op…

View the original paper on arXiv