杨亮

个人信息Personal Information

副教授

硕士生导师

性别:男

毕业院校:大连理工大学

学位:博士

所在单位:计算机科学与技术学院

办公地点:创新园大厦A1028

联系方式:liang@dlut.edu.cn

电子邮箱:liang@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Cross-domain Sentiment Classification via Constructing Semantic Correlation

点击次数:

论文类型:期刊论文

发表时间:2017-01-01

发表刊物:IAENG International Journal of Computer Science

收录刊物:Scopus、EI

卷号:44

期号:2

页面范围:172-179

摘要:Cross-domain sentiment classification trains robust classifiers across domains with the help of source domain labeled data. Sentiment is expressed differently in different domains. Sentiment terms that occur in a source domain may not appear in a different target domain. Such feature mismatches hinder cross-domain sentiment classification. Previous studies have addressed this problem by constructing a common feature representation or subspace; however, they have not considered semantic correlations between features. In this paper, we propose a cross-domain semantic correlation auto-correspondence method (CSCW) by capturing similar semantic features from different domains. First, our method uses sentiment-invariance words as features by considering their properties as strong sentiment indicators and their invariance across domains. Second, we extracted the top-N pivot features using a common frequency among source and target domains. These pivot features can then be employed to find semantically similar sentiment features from both domains. Third, for each pivot feature, by calculating the semantic similarity between non-pivot features and pivot features from either domain with the help of Word2Vec, we construct similar-pivot feature pairs that express similar sentiments but in different representations in either domain. Finally, we transform these pairs to align similar sentiment feature representations. This process avoids feature mismatches and reduces sentiment discrepancies between domains. The experimental results from testing on 12 source-target domain pairs of Amazon product reviews demonstrate that our method significantly outperforms previous approaches in sentiment classification.