arXiv 2112.03570

Membership Inference Attacks From First Principles

By Nicholas Carlini, Steve Chien, et al.

Published 2021-12-07

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

A membership inference attack allows an adversary to query a trained machine learning model to predict whether or not a particular example was contained in the model's training dataset. These attacks are currently evaluated using average-case "accuracy" metrics that fail to characterize whether the attack can confidently identify any members of the training set. We argue that attacks should instead be evaluated by c…

View the original paper on arXiv