高级工程师
性别: 男
毕业院校: 大连理工大学
学位: 博士
所在单位: 计算机科学与技术学院
学科: 计算机应用技术
办公地点: 创新园大厦D0103房间
联系方式: QQ:2407849530
电子邮箱: xukan@dlut.edu.cn
qq : 2407849530
开通时间: ..
最后更新时间: ..
点击次数:
论文类型: 期刊论文
发表时间: 2012-07-01
发表刊物: Journal of Computational Information Systems
收录刊物: EI、Scopus
卷号: 8
期号: 13
页面范围: 5607-5614
ISSN号: 15539105
摘要: Query expansion technologies are widely used in many information retrieval tasks. Most existing approaches are based on the assumption that the most informative terms can be select from top-retrieved documents in the document context level. However, the query expansion methods for general tasks tend not to be optimal choice for special tasks, such as patent search. In the patent articles, the same word from different context fields may be of different importance for retrieval, since the fields, e.g., title and abstracts, describe the patent from various aspects. So these fields may be used to weight the expansion terms more accurately. In this work, we explore the possibility and potential of text fields to extract more effective expansion terms. In particular, we propose a two-stage ranking approach for query expansion based on document fields. First we select top-retrieved documents by BM25F; second, we explore how to weight the different fields based on their importance to improve the term ranking method for effective expansion terms. Experimental results on three TREC test collections show that the patent retrieval performance can be improved when the term ranking method based on fields is used. ? 2011 by Binary Information Press.