arXiv 2505.11904
K*-Means: A Parameter-free Clustering Algorithm
By Louis Mahon and Mirella Lapata
Published 2025-05-17
Citation lineage
Review the prior work and downstream research connected to this paper.
Clustering is a widely used and powerful machine learning technique, but its effectiveness is often limited by the need to specify the number of clusters, k, or by relying on thresholds that implicitly determine k. We introduce k*-means, a novel clustering algorithm that eliminates the need to set k or any other parameters. Instead, it uses the minimum description length principle to automatically determine the opti…