arXiv 2508.10956
ORBIT: An Object Property Reasoning Benchmark for Visual Inference Tasks
By Abhishek Kolari, Mohammadhossein Khojasteh, et al.
Published 2025-08-14
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
While vision-language models (VLMs) have made remarkable progress on many popular visual question answering (VQA) benchmarks, it remains unclear whether they abstract and reason over depicted objects. Inspired by human object categorisation, object property reasoning involves identifying and recognising low-level details and higher-level abstractions. While current VQA benchmarks consider a limited set of object pro…