K-Means Clustering

While not a particularly interesting computation, it is a now-classic benchmark for Big Data ML. We specifically developed our PC $k$-means implementation to closely matchthe implementation in Spark’s mllib. Both implementations use the standard trick, where, to find the centroid