黄德根Huang Degen

(教授)

 博士生导师  硕士生导师
学位:博士
性别:男
毕业院校:大连理工大学
所在单位:计算机科学与技术学院
电子邮箱:huangdg@dlut.edu.cn

论文成果

A Three-Layered Collocation Extraction Tool and Its Application in China English Studies

发表时间:2019-03-11 点击次数:

论文名称:A Three-Layered Collocation Extraction Tool and Its Application in China English Studies
论文类型:会议论文
收录刊物:EI、CPCI-S、Scopus
卷号:9427
页面范围:38-49
关键字:Collocation extraction; Dependency relation; China English
摘要:We design a three-layered collocation extraction tool by integrating syntactic and semantic knowledge and apply it in China English studies. The tool first extracts peripheral collocations in the frequency layer from dependency triples, then extracts semi-peripheral collocations in the syntactic layer by association measures, and last extracts core collocations in the semantic layer with a similar word thesaurus. The syntactic constraints filter out much noise from surface co-occurrences, and the semantic constraints are effective in identifying the very "core" collocations. The tool is applied to automatically extract collocations from a large corpus of China English we compile to explore how China English as a variety of English is nativilized. Then we analyze similarities and differences of the typical China English collocations of a group of verbs. The tool and results can be applied in the compilation of language resources for Chinese-English translation and corpus-based China studies.
发表时间:2015-11-13