location: Current position: Home >> Scientific Research >> Paper Publications

Integrating PPI datasets with the PPI data from biomedical literature for protein complex detection

Hits:

Indexed by:期刊论文

Date of Publication:2014-10-22

Journal:BMC MEDICAL GENOMICS

Included Journals:SCIE、Scopus

Volume:7

Issue:2

ISSN No.:1755-8794

Abstract:Background: Protein complexes are important for understanding principles of cellular organization and function. High-throughput experimental techniques have produced a large amount of protein-protein interactions (PPIs), making it possible to predict protein complexes from protein-protein interaction networks. On the other hand, the rapidly growing biomedical literature provides a significantly large and readily available source of interaction data, which can be integrated into the protein network for better complex detection performance.
   Methods: We present an approach of integrating PPI datasets with the PPI data from biomedical literature for protein complex detection. The approach applies a sophisticated natural language processing system, PPIExtractor, to extract PPI data from biomedical literature. These data are then integrated into the PPI datasets for complex detection.
   Results: The experimental results of the state-of-the-art complex detection method, ClusterONE, on five yeast PPI datasets verify our method's effectiveness: compared with the original PPI datasets, the average improvements of 3.976 and 5.416 percentage units in the maximum matching ratio (MMR) are achieved on the new networks using the MIPS and SGD gold standards, respectively. In addition, our approach also proves to be effective for three other complex detection algorithms proposed in recent years, i.e. CMC, COACH and RRW.
   Conclusions: The rapidly growing biomedical literature provides a significantly large, readily available and relatively accurate source of interaction data, which can be integrated into the protein network for better protein complex detection performance.

Pre One:徐博,林鸿飞,林原,王健. 一种基于排序学习方法的查询扩展技术

Next One:Predicting protein complex in protein interaction network - a supervised learning based method