南晓莉

个人信息Personal Information

副教授

硕士生导师

性别:女

毕业院校:山东大学

学位:硕士

所在单位:金融与会计研究所

学科:金融学. 会计学

办公地点:大连理工大学经济与管理学院D座466室

联系方式:0411-84706106

电子邮箱:nanxiaoli@dlut.edu.cn

扫描关注

论文成果

当前位置: 南晓莉 >> 科学研究 >> 论文成果

A latent discriminative variable model for automatic identification of Chinese base phrases

点击次数:

论文类型:期刊论文

发表时间:2010-07-01

发表刊物:Journal of Information and Computational Science

收录刊物:EI、Scopus

卷号:7

期号:7

页面范围:1535-1541

ISSN号:15487741

摘要:In the fields of natural language processing such as information processing and machine translation, recognizing simple and non-recursive Chinese base phrases is an important task. In stead of rule-based model, we adopt the statistical machine learning method, newly proposed Latent semi-CRF model to solve the Chinese base phrase chunking problem. The Chinese base phrases could be treated as the sequence labeling problem, which involve the prediction of a class label for each frame in an unsegmented sequence. The Chinese base phrases have sub-structures which could not be observed in training data. Latent semi-CRF, which incorporates the advantages of Latent Dynamic Conditional Random Fields and semi-CRF that model the sub-structure of a class sequence and learn dynamics between class labels, in detecting the Chinese base phrases. Our results demonstrate that the latent dynamic discriminative model compares favorably to Support Vector Machines, Maximum Entropy Model, and Conditional Random Fields (including LDCRF and semi-CRF) on Chinese base phrases chunking. Copyright ? 2010 Binary Information Press.