This is an R implementation for clustering example provided with Mahuot. The orignal problem description is:
A time series of control charts needs to be clustered into their close knit groups. The data set we use is synthetic and so resembles real world information in an anonymized format. It contains six different classes (Normal, Cyclic, Increasing trend, Decreasing trend, Upward shift, Downward shift). With these trends occurring on the input data set, the Mahout clustering algorithm will cluster the data into their corresponding class buckets. At the end of this example, you’ll get to learn how to perform clustering using Mahout.
We will be doing the same but using R instead of Mahout. The input dataset is available here.
Here is the script:
|1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27||
Here are the graphs produced when we run the above script with no. of clusters,
Distance from centroid