arXiv 2510.13494

LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA

By Tommaso Bonomo, Luca Gioffré, et al.

Published 2025-10-15

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Question Answering (QA) on narrative text poses a unique challenge to current systems, requiring a deep understanding of long, complex documents. However, the reliability of NarrativeQA, the most widely used benchmark in this domain, is hindered by noisy documents and flawed QA pairs. In this work, we introduce LiteraryQA, a high-quality subset of NarrativeQA focused on literary works. Using a human- and LLM-validat…

View the original paper on arXiv