arXiv 2503.21770

Visual Jenga: Discovering Object Dependencies via Counterfactual Inpainting

By Anand Bhattad, Konpat Preechakul, et al.

Published 2025-03-27

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

This paper proposes a novel scene understanding task called Visual Jenga. Drawing inspiration from the game Jenga, the proposed task involves progressively removing objects from a single image until only the background remains. Just as Jenga players must understand structural dependencies to maintain tower stability, our task reveals the intrinsic relationships between scene elements by systematically exploring whic…

View the original paper on arXiv