location: Current position: Home >> Scientific Research >> Paper Publications

Improve Biomedical Information Retrieval Using Modified Learning to Rank Methods

Hits:

Indexed by:期刊论文

Date of Publication:2018-11-01

Journal:IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS

Included Journals:SCIE、Scopus

Volume:15

Issue:6

Page Number:1797-1809

ISSN No.:1545-5963

Key Words:Information retrieval; machine learning; supervised learning; text mining

Abstract:In these years, the number of biomedical articles has increased exponentially, which becomes a problem for biologists to capture all the needed information manually. Information retrieval technologies, as the core of search engines, can deal with the problem automatically, providing users with the needed information. However, it is a great challenge to apply these technologies directly for biomedical retrieval, because of the abundance of domain specific terminologies. To enhance biomedical retrieval, we propose a novel framework based on learning to rank. Learning to rank is a series of state-of-the-art information retrieval techniques, and has been proved effective in many information retrieval tasks. In the proposed framework, we attempt to tackle the problem of the abundance of terminologies by constructing ranking models, which focus on not only retrieving the most relevant documents, but also diversifying the searching results to increase the completeness of the resulting list for a given query. In the model training, we propose two novel document labeling strategies, and combine several traditional retrieval models as learning features. Besides, we also investigate the usefulness of different learning to rank approaches in our framework. Experimental results on TREC Genomics datasets demonstrate the effectiveness of our framework for biomedical information retrieval.

Pre One:A neural network approach to chemical and gene/protein entity recognition in patents

Next One:Identifying protein complexes based on node embeddings obtained from protein-protein interaction networks