大连理工大学主页平台管理系统顾宏 A Method for Improving Protein Localization Prediction from Datasets with Outliers 中文主页

顾宏

教授博士生导师硕士生导师
性别：男
毕业院校：浙江大学
学位：博士
所在单位：控制科学与工程学院
学科：模式识别与智能系统
办公地点：创新园大厦B0715
电子邮箱：

访问量：

开通时间：..

最后更新时间：..

移动版主页

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

A Method for Improving Protein Localization Prediction from Datasets with Outliers

点击次数：

发布时间：2019-03-11

论文类型：会议论文

发表时间：2009-03-30

收录刊物：Scopus、CPCI-S、EI

页面范围：100-105

摘要：Large-scale genome analysis and drug discovery require an automated prediction method for protein subcellular localization, and Support Vector Machines (SVMs) effectively solve this problem in a supervised manner. However, the protein subcellular localization datasets obtained from experiments always contain outliers, which can lead to poor generalization ability and classification accuracy. To address this issue, we first analyzed the influence of Principal Component Analysis (PCA) on classification performance, and then proposed a hybrid method for prediction of protein subcellular localization based on Weighted Supported Vector Machine (WSVM) and PCA. Different weights were assigned to different data points, so the training algorithm could learn the decision boundary according to the relative importance of the data points. After performing dimension reduction operations on the datasets, kernel-based possibilistic c-means (KPCM) was chosen to generate weights for this algorithm, as it generates relative high values for important data points but low values for outliers. Experimental results on a benchmark dataset show promising results, which confirms the effectiveness of the proposed method in terms of prediction accuracy.

上一条：基于信息效用及效率的移动商务系统分析方法

下一条：Predicting Protein Subcellular Locations for Gram-Negative Bacteria Using Neural Networks Ensemble