arXiv 2112.01527

Masked-attention Mask Transformer for Universal Image Segmentation

By Bowen Cheng, Ishan Misra, et al.

Published 2021-12-02

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Image segmentation is about grouping pixels with different semantics, e.g., category or instance membership, where each choice of semantics defines a task. While only the semantics of each task differ, current research focuses on designing specialized architectures for each task. We present Masked-attention Mask Transformer (Mask2Former), a new architecture capable of addressing any image segmentation task (panoptic…

View the original paper on arXiv