高级工程师
性别: 男
毕业院校: 大连理工大学
学位: 博士
所在单位: 计算机科学与技术学院
学科: 计算机应用技术
办公地点: 创新园大厦D0103房间
联系方式: QQ:2407849530
电子邮箱: xukan@dlut.edu.cn
qq : 2407849530
开通时间: ..
最后更新时间: ..
点击次数:
论文类型: 会议论文
发表时间: 2017-01-01
收录刊物: EI、CPCI-S
卷号: Part F131841
页面范围: 2395-2398
关键字: Learning to rank; autoencoders; semi-supervised learning
摘要: Learning to rank utilizes machine learning methods to solve ranking problems by constructing ranking models in a supervised way, which needs fixed-length feature vectors of documents as inputs, and outputs the ranking models learned by iteratively reducing the pre-defined ranking loss. The document features are always extracted based on classic textual statistics, and different features contribute differently to ranking performance. Given that well-defined features would contribute more to the retrieval performance, we investigate the usage of autoencoders to enrich the feature representations of documents. Autoencoders, as basic building blocks of deep neural networks, have been successfully used in many text mining tasks for generating effective features. To enrich the feature space for learning to rank, we introduce supervision into the loss functions of autoencoders. Specifically, we first train a linear ranking model on the training data, and then incorporate the learned weights into the reconstruction costs of an autoencoder. Meanwhile, we accumulate the costs of documents for a given query with query level constraints for producing more useful features. We evaluate the effectiveness of our model on three LETOR datasets, and show that our model can generate effective document features to improve the retrieval performance.