arXiv 2410.14251

Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation

By Shuo Tang, Xianghe Pang, et al.

Published 2024-10-18

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Post-training is essential for enabling large language models (LLMs) to follow human instructions. However, its effectiveness depends on high-quality instruction data, which is challenging to obtain in the real world due to privacy concerns, data scarcity, and high annotation costs. To fill this gap, inspired by the recent success of using LLMs to simulate human society, we propose MATRIX, a multi-agent simulator th…

View the original paper on arXiv