黄德根Huang Degen

(教授)

 博士生导师  硕士生导师
学位:博士
性别:男
毕业院校:大连理工大学
所在单位:计算机科学与技术学院
电子邮箱:huangdg@dlut.edu.cn

论文成果

CHINESE LEXICAL ANALYSIS BASED ON HYBRID MMSM MODEL

发表时间:2019-03-09 点击次数:

论文名称:CHINESE LEXICAL ANALYSIS BASED ON HYBRID MMSM MODEL
论文类型:期刊论文
发表刊物:INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL
收录刊物:SCIE、EI、Scopus
卷号:5
期号:12A
页面范围:4523-4530
ISSN号:1349-4198
关键字:Chinese morphological analysis; MMSM model; CRF; Hidden semi-CRF
摘要:In this paper, we describe a scheme for Chinese word segmentation and POS tagging which integrates the character-based and word-based information in the directed graph generated by the MMSM-model. Word-level information is effective for analysis of known words, while character-level information is useful for analysis of unknown, words. A Hidden semi-CRF model is proposed for the unknown words detection. and POS tagging. The proposed Hidden semi-CRF has two state chains with unequal states which Can perform segmentation and POS tagging of unknown words simultaneously. The hybrid model was evaluated using the test data from SIGHAN-6 and achieved higher F-score than the stage-of-the-art models.
发表时间:2009-12-01