Welcome to Journal of Beijing Institute of Technology
CAO Qi-min, GUO Qiao, WU Xiang-hua. Similarity matrix-based K-means algorithm for text clustering[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2015, 24(4): 566-572. DOI: 10.15918/j.jbit1004-0579.201524.0421
Citation: CAO Qi-min, GUO Qiao, WU Xiang-hua. Similarity matrix-based K-means algorithm for text clustering[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2015, 24(4): 566-572. DOI: 10.15918/j.jbit1004-0579.201524.0421

Similarity matrix-based K-means algorithm for text clustering

  • K-means algorithm is one of the most widely used algorithms in the clustering analysis. To deal with the problem caused by the random selection of initial center points in the traditional algorithm, this paper proposes an improved K-means algorithm based on the similarity matrix. The improved algorithm can effectively avoid the random selection of initial center points, therefore it can provide effective initial points for clustering process, and reduce the fluctuation of clustering results which are resulted from initial points selections, thus a better clustering quality can be obtained. The experimental results also show that the F-measure of the improved K-means algorithm has been greatly improved and the clustering results are more stable.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return
    Baidu
    map