Feature selection for human membrane protein type classification using filter methods

Glenda Anak Kaya, Nor Ashikin Mohamad Kamal

Abstract


As the number of protein sequences in the database is increasing, effective and efficient techniques are needed to make these data meaningful.  These protein sequences contain redundant and irrelevant features that cause lower classification accuracy and increase the running time of the computational algorithm. In this paper, we select the best features using Minimum Redundancy Maximum Relevance(mRMR) and Correlation-based feature selection(CFS) methods. Two datasets of human membrane protein are used, S1 and S2.  After the features have been selected by mRMR and CFS, K-Nearest Neighbor(KNN) and Support Vector Machine(SVM) classifiers are used to classify these membrane proteins. The performance of these techniques is measured using accuracy, specificity and sensitivity. and F-measure. The proposed algorithm managed to achieve 76% accuracy for S1 and 73% accuracy for S2. Finally, our proposed methods present competitive results when compared with the previous works on membrane protein classification.

Keywords


CFS; Feature selection; KNN; Membrane protein; mRMR



DOI: http://doi.org/10.11591/ijai.v8.i4.pp%25p
Total views : 43 times

Refbacks

  • There are currently no refbacks.


Copyright (c) 2019 Institute of Advanced Engineering and Science

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

View IJAI Stats

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.