arXiv 2409.03444

Fine-tuning large language models for domain adaptation: Exploration of training strategies, scaling, model merging and synergistic capabilities

By Wei Lu, Rachel K. Luu, et al.

Published 2024-09-05

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

The advancement of Large Language Models (LLMs) for domain applications in fields such as materials science and engineering depends on the development of fine-tuning strategies that adapt models for specialized, technical capabilities. In this work, we explore the effects of Continued Pretraining (CPT), Supervised Fine-Tuning (SFT), and various preference-based optimization approaches, including Direct Preference Op…

View the original paper on arXiv