arXiv 2505.11904

K*-Means: A Parameter-free Clustering Algorithm

By Louis Mahon and Mirella Lapata

Published 2025-05-17

Citation lineage

Review the prior work and downstream research connected to this paper.

Clustering is a widely used and powerful machine learning technique, but its effectiveness is often limited by the need to specify the number of clusters, k, or by relying on thresholds that implicitly determine k. We introduce k*-means, a novel clustering algorithm that eliminates the need to set k or any other parameters. Instead, it uses the minimum description length principle to automatically determine the opti…

View the original paper on arXiv