黄德根Huang Degen

(教授)

 博士生导师  硕士生导师
学位:博士
性别:男
毕业院校:大连理工大学
所在单位:计算机科学与技术学院
电子邮箱:huangdg@dlut.edu.cn

论文成果

Automatic part-of-speech tagging for Oromo language using Maximum Entropy Markov Model (MEMM)

发表时间:2019-03-12 点击次数:

论文名称:Automatic part-of-speech tagging for Oromo language using Maximum Entropy Markov Model (MEMM)
论文类型:期刊论文
发表刊物:Journal of Information and Computational Science
收录刊物:EI、Scopus
卷号:11
期号:10
页面范围:3319-3334
ISSN号:15487741
摘要:The problem of Part-of-speech tagging (POS tagging) for natural language processing task or computational linguistics is inevitable for every natural language of mankind. In this paper, we present experimental results on one of the state-of-the-art probabilistic model for sequence classification, Maximum Entropy Markov Model (MEMM), for tagging Oromo language. This model assigns the correct part-of-speech tag to each word or token of the sentence, considering many features and contexts. We used a MEMM and it was found to be the best way to estimate word classes of Oromo text. To implement the model, experiments were conducted on a manually annotated corpus of 452 sentences (total of 6094 words) of Oromo language. Experimental results show that the new algorithm performs well with accuracy of 93.01% evaluated by tenfold cross validation. By the result of this paper it can be generalized that this modelling technique, MEMM, has shown some advantages over Hidden Markov Models for sequence tagging since it offers increased freedom in choosing features to represent observations for POS tagging of oromo language. 1548-7741/Copyright ? 2014 Binary Information Press.
发表时间:2014-07-01