arXiv 2410.14251
Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation
By Shuo Tang, Xianghe Pang, et al.
Published 2024-10-18
Citation lineage
Review the prior work and downstream research connected to this paper.
Post-training is essential for enabling large language models (LLMs) to follow human instructions. However, its effectiveness depends on high-quality instruction data, which is challenging to obtain in the real world due to privacy concerns, data scarcity, and high annotation costs. To fill this gap, inspired by the recent success of using LLMs to simulate human society, we propose MATRIX, a multi-agent simulator th…