Hits:
Indexed by:会议论文
Date of Publication:2012-10-04
Included Journals:EI、CPCI-S、Scopus
Page Number:640-647
Key Words:Protein complexes; Naive Bayes; Machine Learning
Abstract:Many computational methods have been applied to identify protein complexes from experimentally obtained protein-protein interaction (PPI) networks. Because of the presence of unreliable interactions in PPI networks, multi-functionality of proteins, and complex connectivity of the PPI network, the task is very challenging. In this study, we tackle the presence of unreliable interactions in protein complex using Genetic-Algorithm Fuzzy Naive Bayes (GAFNB) which takes unreliability into consideration. Many existing methods can provide lots of candidate subgraphs. So we focused on how to classify the protein complexes from the subgraphs by considering the fuzzy attribute of PPI. We experimented with two datasets of size 10,371 and 986, each containing 493 positive protein complexes from MIPS and TAP-MS datasets. We compared the performance of GAFNB with Naive Bayes (NB). Results show that GAFNB performed better which indicates that a fuzzy model is more suitable when unreliability is present. It is necessary to consider the unreliability in identifying protein complexes.