arXiv 2505.11904

K*-Means: A Parameter-free Clustering Algorithm

By Louis Mahon and Mirella Lapata

Published 2025-05-17

Discussion

Read the public discussion and references gathered around this paper.

Clustering is a widely used and powerful machine learning technique, but its effectiveness is often limited by the need to specify the number of clusters, k, or by relying on thresholds that implicitly determine k. We introduce k*-means, a novel clustering algorithm that eliminates the need to set k or any other parameters. Instead, it uses the minimum description length principle to automatically determine the opti…

View the original paper on arXiv