arXiv 2510.13494
LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA
By Tommaso Bonomo, Luca Gioffré, et al.
Published 2025-10-15
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
Question Answering (QA) on narrative text poses a unique challenge to current systems, requiring a deep understanding of long, complex documents. However, the reliability of NarrativeQA, the most widely used benchmark in this domain, is hindered by noisy documents and flawed QA pairs. In this work, we introduce LiteraryQA, a high-quality subset of NarrativeQA focused on literary works. Using a human- and LLM-validat…