arXiv 2508.10956

ORBIT: An Object Property Reasoning Benchmark for Visual Inference Tasks

By Abhishek Kolari, Mohammadhossein Khojasteh, et al.

Published 2025-08-14

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

While vision-language models (VLMs) have made remarkable progress on many popular visual question answering (VQA) benchmarks, it remains unclear whether they abstract and reason over depicted objects. Inspired by human object categorisation, object property reasoning involves identifying and recognising low-level details and higher-level abstractions. While current VQA benchmarks consider a limited set of object pro…

View the original paper on arXiv