arXiv 2506.23352

GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields

By Shunsuke Yasuki, Taiki Miyanishi, et al.

Published 2025-06-29

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

The advancement of 3D language fields has enabled intuitive interactions with 3D scenes via natural language. However, existing approaches are typically limited to small-scale environments, lacking the scalability and compositional reasoning capabilities necessary for large, complex urban settings. To overcome these limitations, we propose GeoProg3D, a visual programming framework that enables natural language-drive…

View the original paper on arXiv