Submit Manuscript  

Article Details


ML-rRBF-ECOC: A Multi-Label Learning Classifier for Predicting Protein Subcellular Localization with Both Single and Multiple Sites

[ Vol. 16 , Issue. 5 ]

Author(s):

Guo-Sheng Han and Zu-Guo Yu*   Pages 359 - 365 ( 7 )

Abstract:


Background: The subcellular localization of a protein is closely related with its functions and interactions. More and more evidences show that proteins may simultaneously exist at, or move between, two or more different subcellular localizations. Therefore, predicting protein subcellular localization is an important but challenging problem.

Observation: Most of the existing methods for predicting protein subcellular localization assume that a protein locates at a single site. Although a few methods have been proposed to deal with proteins with multiple sites, correlations between subcellular localization are not efficiently taken into account. In this paper, we propose an integrated method for predicting protein subcellular localizations with both single site and multiple sites.

Methods: Firstly, we extend the Multi-Label Radial Basis Function (ML-RBF) method to the regularized version, and augment the first layer of ML-RBF to take local correlations between subcellular localization into account. Secondly, we embed the modified ML-RBF into a multi-label Error-Correcting Output Codes (ECOC) method in order to further consider the subcellular localization dependency. We name our method ML-rRBF-ECOC. Finally, the performance of ML-rRBF-ECOC is evaluated on three benchmark datasets.

Results: The results demonstrate that ML-rRBF-ECOC has highly competitive performance to the related multi-label learning method and some state-of-the-art methods for predicting protein subcellular localizations with multiple sites. Considering dependency between subcellular localizations can contribute to the improvement of prediction performance.

Conclusion: This also indicates that correlations between different subcellular localizations really exist. Our method at least plays a complementary role to existing methods for predicting protein subcellular localizations with multiple sites.

Keywords:

Subcellular localization, multiple sites, multi-label radial basis function, error-correcting output codes, multi-label, label correlations.

Affiliation:

Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Hunan 411105, Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Hunan 411105

Graphical Abstract:



Read Full-Text article