徐博

个人信息Personal Information

副教授

博士生导师

硕士生导师

性别:女

毕业院校:大连理工大学

学位:博士

所在单位:软件学院、国际信息与软件学院

办公地点:开发区校区综合楼317

电子邮箱:BoXu@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Multipath2vec: Predicting Pathogenic Genes via Heterogeneous Network Embedding

点击次数:

论文类型:会议论文

发表时间:2018-01-01

收录刊物:CPCI-S

页面范围:951-956

关键字:Prediction of pathogenic genes; heterogeneous network embedding; disease-causing genes; PPI

摘要:Phenotypically similar diseases have been verified to be in connection with specific genes. Predicting disease genes is important in disease prevention, diagnosis, and treatment. In this work, we focus on this significant issue and propose a disease-causing genes prediction method called Multipath2vec. First, we generate an heterogeneous network called GP-network, which is constructed based on three kinds of relationships between genes and phenotypes, including interactions between genes, correlations between phenotypes, and known gene-phenotype pairs. Then, we propose the multi-path, which is used to guide random walk in GP-network in order to better embedding the network. Finally, we use the achieved vector representation of each protein and phenotype to calculate and rank the similarities between candidate genes and the target phenotype. We implement Multipath2vec as well as two baseline approaches (i.e., CATAPULT, and PRINCE) on whole gene-phenotype data, single-gene gene-phenotype data, and many-genes gene-phenotype data. According to leave-one-out cross validation, Multipath2vec achieves better results than baseline approaches. To our best knowledge, this is the first attempt to use heterogeneous network embedding method in handling pathogenic genes. The outperformed experimental results of Multipath2vec shed light on the possibility of applying network representation methods in the disease-causing genes prediction.