Code Plus Tech Talk: algorithm analysis

Saturday, May 11, 2013

Time & Space Complexity of Basic K-means Algorithm

The basic k-means clustering algorithm is a simple algorithm that separates the given data space into different clusters based on centroids calculation using some proximity function. Using this algorithm, we first choose the k- points as initial centroids and then each point is assigned to a cluster with the closest centroid. The algorithm is formally described as follows:

Input: A data set D containing m objects (points) with n attributes in an Euclidean space

Output: Partitioning of m objects into k-clusters C_1, C₂, C_3, …, C_k, i.e. C_i ⊂ D and C_i ∩ C_j = ᶲ (for 1 ≤ i, j ≤ k)