location: Current position: Home >> Scientific Research >> Paper Publications

iLM-2L: A two-level predictor for identifying protein lysine methylation sites and their methylation degrees by incorporating K-gap amino acid pairs into Chou's general PseAAC

Hits:

Indexed by:Journal Papers

Date of Publication:2015-11-21

Journal:JOURNAL OF THEORETICAL BIOLOGY

Included Journals:PubMed、SCIE、Scopus

Volume:385

Issue:385

Page Number:50-57

ISSN No.:0022-5193

Key Words:K-spaced amino acid pair; Multi-label classification; Post-translational modification; Support vector machine

Abstract:As one of the most critical post-translational modifications, lysine methylation plays a key role in regulating various protein functions. In order to understand the molecular mechanism of lysine methylation, it is important to identify lysine methylation sites and their methylation degrees accurately. As the traditional experimental methods are time-consuming and labor-intensive, several computational methods have been developed for the identification of methylation sites. However, the prediction accuracy of existing computational methods is still unsatisfactory. Moreover, they are only focused on predicting whether a query lysine residue is a methylation site, without considering its methylation degrees. In this paper, a novel two-level predictor named iLM-2L is proposed to predict lysine methylation sites and their methylation degrees using composition of k-spaced amino acid pairs feature coding scheme and support vector machine algorithm. The 1st level is to identify whether a query lysine residue is a methylation site, and the 2nd level is to identify which methylation degree(s) the query lysine residue belongs to if it has been predicted as a methyllysine site in the 1st level identification. The iLM-2L achieves a promising performance with a Sensitivity of 76.46%, a Specificity of 91.90%, an Accuracy of 85.31% and a Matthew's correlation coefficient of 69.94% for the 1st level as well as a Precision of 84.81%, an accuracy of 79.35%, a recall of 80.83%, an Absolute_Ture of 73.89% and a Hamming_loss of 15.63% for the 2nd level in jackknife test. As illustrated by independent test, the performance of iLM-2L outperforms other existing lysine methylation site predictors significantly. A matlab software package for iLM-2L can be freely downloaded from https://github.com/juzhe1120/Matlab_Software/blob/master/iLM-2L_Matlab_Softwaresar. (C) 2015 Published by Elsevier Ltd.

Pre One:A Remote Engine Health Management System based on Mobile cloud computing

Next One:基于变分高斯过程模型的快速核偏标记学习算法