刘晓东

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:男

毕业院校:东北大学

学位:博士

所在单位:控制科学与工程学院

学科:应用数学. 应用数学. 控制理论与控制工程

办公地点:创新园大厦A0620

联系方式:电话: (+86-411) 84726020 (home) (+86-411) 84709380 (Office) 传真: (+86-411) 84707579 手机: (+86-411) 13130042458

电子邮箱:xdliuros@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

iPseU-Layer: Identifying RNA Pseudouridine Sites Using Layered Ensemble Model

点击次数:

论文类型:期刊论文

发表时间:2020-06-01

发表刊物:INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES

收录刊物:PubMed、SCIE

卷号:12

期号:2

页面范围:193-203

ISSN号:1913-2751

关键字:Pseudouridine; Feature extraction; Ensemble model; Prediction

摘要:Pseudouridine represents one of the most prevalent post-transcriptional RNA modifications. The identification of pseudouridine sites is an essential step toward understanding RNA functions, RNA structure stabilization, translation process, and RNA stability; however, high-throughput experimental techniques remain expensive and time-consuming in lab explorations and biochemical processes. Thus, how to develop an efficient pseudouridine site identification method based on machine learning is very important both in academic research and drug development. Motived by this, we present an effective layered ensemble model designated as iPseU-Layer for identification of RNA pseudouridine sites. The proposed iPseU-Layer approach is essentially based on three different machine learning layers including: feature selection layer, feature extraction and fusion layer, and prediction layer. The feature selection layer reduces the dimensionality, which can be regarded as a data pre-processing stage. The feature extraction and fusion layer utilizes an ensemble method which is implemented through various machine learning algorithms to generate some outputs. The prediction layer applies classic random forest to identify the final results. Furthermore, we systematically conduct the validation experiments using cross-validation tests and independent test with the current state-of-the-art models. The proposed iPseU-Layer provides a promising predictive performance in terms of sensitivity, specificity, accuracy and Matthews correlation coefficient. Collectively, these findings indicate that the framework of iPseU-Layer is a feasible and effective strategy for the prediction of RNA pseudouridine sites.