%A Lingxiao Ma;Yi Li;Hancong Tang;Weilai Chi;Depeng Dang %T Parallel Chameleon Clustering Based on MapReduce %0 Journal Article %D 2015 %J JOURNAL OF INFORMATION &COMPUTATIONAL SCIENCE %R 10.12733/jics20105661 %P 2053-2062 %V 12 %N 6 %U {http://manu35.magtech.com.cn/Jwk_ics/CN/abstract/article_2964.shtml} %8 %X With the enlarging volumes of datasets in various areas and the rapid development of distributed technologies, parallel clustering is becoming increasingly important. To cluster large-scale data of various shapes, this paper proposes a parallel Chameleon clustering algorithm. The key idea is using a parallel minimum spanning tree algorithm to generate the initial clusters after obtaining the k-nearest neighbor graph of the original dataset in a parallel way inspired by matrix multiplication, and then using strategies suggested by the primary Chameleon clustering to combine clusters and obtain the final clusters. Finally, we design the parallel Chameleon clustering based on MapReduce. Experiments show that this algorithm is efficient and well-performed.