arXiv 2411.15594

A Survey on LLM-as-a-Judge

By Jiawei Gu, Xuhui Jiang, et al.

Published 2024-11-23

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Accurate and consistent evaluation is crucial for decision-making across numerous fields, yet it remains a challenging task due to inherent subjectivity, variability, and scale. Large Language Models (LLMs) have achieved remarkable success across diverse domains, leading to the emergence of "LLM-as-a-Judge," where LLMs are employed as evaluators for complex tasks. With their ability to process diverse data types and…

View the original paper on arXiv