顾宏
开通时间:..
最后更新时间:..
点击次数:
论文类型:期刊论文
发表时间:2016-05-21
发表刊物:JOURNAL OF THEORETICAL BIOLOGY
收录刊物:PubMed、SCIE、Scopus
卷号:397
期号:397
页面范围:145-150
ISSN号:0022-5193
关键字:Post-translational modification; Phosphoglycerylation; k-spaced amino acid pairs; Fuzzy support vector machine
摘要:As a new type of post-translational modification, lysine phosphoglycerylation plays a key role in regulating glycolytic process and metabolism in cells. Due to the traditional experimental methods are time-consuming and labor-intensive, it is important to develop computational methods to identify the potential phosphoglycerylation sites. However, the prediction performance of the existing phosphoglycerylation site predictor is not satisfactory. In this study, a novel predictor named CKSAAP_PhoglySite is developed to predict phosphoglycerylation sites by using composition of k-spaced amino acid pairs and fuzzy support vector machine. On the one hand, after many aspects of assessments, we find the composition of k-spaced amino acid pairs is more suitable for representing the protein sequence around the phosphoglycerylation sites than other encoding schemes. On the other hand, the proposed fuzzy support vector machine algorithm can effectively handle the imbalanced and noisy problem in phosphoglycerylation sites training dataset. Experimental results indicate that CKSAAP_PhoglySite outperforms the existing phosphoglycerylation site predictor Phogly-PseAAC significantly. A matlab software package for CKSAAP_PhoglySite can be freely downloaded from https://github.cornijuzhe1120/Matlab_Software/blob/master/CKSAAP_PhoglySite_Matlab_Software.zip. (C) 2016 Elsevier Ltd. All rights reserved.