个人信息

林原

(副教授)

 硕士生导师
学位:博士
性别:男
毕业院校:大连理工大学
所在单位:公共管理学院
电子邮箱:zhlin@dlut.edu.cn

论文成果

当前位置: 林原主页 >> 科学研究 >> 论文成果

Tripartite-replicated softmax model for document representations

发表时间:2019-03-11 点击次数:

论文名称:Tripartite-replicated softmax model for document representations
论文类型:会议论文
收录刊物:EI
卷号:10390 LNCS
页面范围:109-121
摘要:Text mining tasks based on machine learning require inputs to be represented as fixed-length vectors, and effective vectors of words, phrases, sentences and even documents may greatly improve the performance of these tasks. Recently, distributed word representations based on neural networks have been demonstrated powerful in many tasks by encoding abundant semantic and linguistic information. However, it remains a great challenge for document representations because of the complex semantic structures in different documents. To meet the challenge, we propose two novel tripartite graphical models for document representations by incorporating word representations into the Replicated Softmax model, and we name the models as Tripartite-Replicated Softmax model (TRPS) and directed Tripartite-Replicated Softmax model (d-TRPS), respectively. We also introduce some optimization strategies for training the proposed models to learn better document representations. The proposed models can capture linear relationships among words and latent semantic information within documents simultaneously, thus learning both linear and nonlinear document representations. We examine the learned document representations in a document classification task and a document retrieval task. Experimental results show that the learned representations by our models outperform the state-of-the-art models in improving the performance of these two tasks. © Springer International Publishing AG 2017.
发表时间:2017-07-13