大连理工大学主页平台管理系统周惠巍 A Multistage Gene Normalization System Integrating Multiple Effective Methods Home

Current position: Home >> Scientific Research >> Paper Publications

A Multistage Gene Normalization System Integrating Multiple Effective Methods

Release Time:2019-03-09 Hits:

Indexed by: Journal Article

Date of Publication: 2013-12-12

Journal: PLOS ONE

Included Journals: Scopus、PubMed、SCIE

Volume: 8

Issue: 12

Page Number: e81956

ISSN: 1932-6203

Abstract: Gene/protein recognition and normalization is an important preliminary step for many biological text mining tasks. In this paper, we present a multistage gene normalization system which consists of four major subtasks: pre-processing, dictionary matching, ambiguity resolution and filtering. For the first subtask, we apply the gene mention tagger developed in our earlier work, which achieves an F-score of 88.42% on the BioCreative II GM testing set. In the stage of dictionary matching, the exact matching and approximate matching between gene names and the EntrezGene lexicon have been combined. For the ambiguity resolution subtask, we propose a semantic similarity disambiguation method based on Munkres' Assignment Algorithm. At the last step, a filter based on Wikipedia has been built to remove the false positives. Experimental results show that the presented system can achieve an F-score of 90.1%, outperforming most of the state-of-the-art systems.

Prev One:基于电气故障探究的电路理论教学法探究

Next One:基于句法结构约束的模糊限制信息范围检测

Home

Scientific Research

Teaching Research

Awards and Honours

Enrollment Information

Student Information

My Album

Blog

A Multistage Gene Normalization System Integrating Multiple Effective Methods