Semi-supervised spectral clustering using shared nearest neighbor for data with different shape and density

Gao YouSheng, Siti Khatijah Nor Abdul Rahim, Raseeda Hamzah, Li Ang, Raihah Aminuddin

Abstract


In the absence of supervisory information in spectral clustering algorithms, it is difficult to construct suitable similarity graphs for data with complex shapes and varying densities. To address this issue, this paper proposes a semisupervised spectral clustering algorithm based on shared nearest neighbor (SNN). The proposed algorithm combines the idea of semi-supervised clustering, adding SNN to the calculation of the distance matrix, and using pairwise constraint information to find the relationship between two data points, while providing a portion of supervised information. Comparative experiments were conducted on artificial data sets and University of California Irvine machine learning repository datasets. The experimental results show that the proposed algorithm achieves better clustering results compared to traditional K-means and spectral clustering algorithms.


Keywords


K-means; Machine learning; Pairwise constraints; Semi-supervised clustering; Shared nearest neighbor; Spectral clustering

Full Text:

PDF


DOI: http://doi.org/10.11591/ijai.v13.i2.pp2283-2290

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

View IJAI Stats