arXiv 2411.15594
A Survey on LLM-as-a-Judge
By Jiawei Gu, Xuhui Jiang, et al.
Published 2024-11-23
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
Accurate and consistent evaluation is crucial for decision-making across numerous fields, yet it remains a challenging task due to inherent subjectivity, variability, and scale. Large Language Models (LLMs) have achieved remarkable success across diverse domains, leading to the emergence of "LLM-as-a-Judge," where LLMs are employed as evaluators for complex tasks. With their ability to process diverse data types and…