马建军

个人信息Personal Information

教授

硕士生导师

性别:女

毕业院校:大连理工大学

学位:博士

所在单位:外国语学院

学科:外国语言学及应用语言学

办公地点:文科楼107

联系方式:majian@dlut.edu.cn

电子邮箱:majian@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

面向机器翻译的英语词性标注研究及其应用(英文)

点击次数:

发表时间:2022-10-09

发表刊物:中国通信

所属单位:外国语学院

卷号:9

期号:03

页面范围:58-67

摘要:A hybrid approach to English Part-of-Speech(PoS) tagging with its target application being English-Chinese machine translation in business domain is presented,demonstrating how a present tagger can be adapted to learn from a small amount of data and handle unknown words for the purpose of machine translation.A small size of 998 k English annotated corpus in business domain is built semi-automatically based on a new tagset;the maximum entropy model is adopted,and rule-based approach is used in post-processing.The tagger is further applied in Noun Phrase(NP) chunking.Experiments show that our tagger achieves an accuracy of 98.14%,which is a quite satisfactory result.In the application to NP chunking,the tagger gives rise to 2.21% increase in F-score,compared with the results using Stanford tagger.

备注:新增回溯数据