arXiv 2408.00714
SAM 2: Segment Anything in Images and Videos
By Nikhila Ravi, Valentin Gabeur, et al.
Published 2024-08-01
Citation lineage
Review the prior work and downstream research connected to this paper.
We present Segment Anything Model 2 (SAM 2), a foundation model towards solving promptable visual segmentation in images and videos. We build a data engine, which improves model and data via user interaction, to collect the largest video segmentation dataset to date. Our model is a simple transformer architecture with streaming memory for real-time video processing. SAM 2 trained on our data provides strong performa…