arXiv 2508.10956
ORBIT: An Object Property Reasoning Benchmark for Visual Inference Tasks
By Abhishek Kolari, Mohammadhossein Khojasteh, et al.
Published 2025-08-14
Wiki summary
Explore the paper's summary, context, and related research on Papiers.
While vision-language models (VLMs) have made remarkable progress on many popular visual question answering (VQA) benchmarks, it remains unclear whether they abstract and reason over depicted objects. Inspired by human object categorisation, object property reasoning involves identifying and recognising low-level details and higher-level abstractions. While current VQA benchmarks consider a limited set of object pro…