顾宏
开通时间:..
最后更新时间:..
点击次数:
论文类型:期刊论文
发表时间:2010-11-01
发表刊物:INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL
收录刊物:SCIE、EI、Scopus
卷号:6
期号:11
页面范围:5093-5103
ISSN号:1349-4198
关键字:Protein subnuclear location; Elman networks ensemble; Amphiphilic PseAA ammo acid composition; Re-substitution test; Jackkinfe test
摘要:Knowledge of nuclear-protein localizations plays a very important role in understanding the biochemical processes of the nucleus With the avalanche of protein sequences generated in the post-qenomic era, an automated method is badly needed to annotate the subnuelear locations of numerous newly found nuclear protein sequences in a short time In this paper, a novel approach is developed for predicting the protein subnuclear locations Firstly, a powerful ensemble classifier based on the Elman networks is proposed Secondly, the protein samples are represented by amphiphilic pseudo amino acid (PseAA) composition, which can incorporate a considerable amount of sequence-order effects Thirdly, six differ cut algorithms are adopted to consider the diversity of the base ensemble, and 18 Elman networks are subsequently obtained by using three different node numbers of neurons in the hidden layers Lastly, as a demonstration, identifications are performed for 9 subnuclear locations in 714 nuclear proteins The accuracy rates, obtained in both a re-substitution test and a jackknife test, are significantly higher than those achieved by other classifiers It is anticipated that the proposed approach may become a useful tool to reduce the huge gap between the number of gene sequences in databases and the number of gene products which have been functionally characterized