黄德根Huang Degen

教授

 博士生导师  硕士生导师
学位:博士
性别:男
毕业院校:大连理工大学
所在单位:计算机科学与技术学院
Email :

论文成果

A Three-Layered Collocation Extraction Tool and Its Application in China English Studies

发布时间:2019-03-11 点击次数:

论文类型:会议论文
收录刊物:Scopus、CPCI-S、EI
卷号:9427
页面范围:38-49
关键字:Collocation extraction; Dependency relation; China English
摘要:We design a three-layered collocation extraction tool by integrating syntactic and semantic knowledge and apply it in China English studies. The tool first extracts peripheral collocations in the frequency layer from dependency triples, then extracts semi-peripheral collocations in the syntactic layer by association measures, and last extracts core collocations in the semantic layer with a similar word thesaurus. The syntactic constraints filter out much noise from surface co-occurrences, and the semantic constraints are effective in identifying the very "core" collocations. The tool is applied to automatically extract collocations from a large corpus of China English we compile to explore how China English as a variety of English is nativilized. Then we analyze similarities and differences of the typical China English collocations of a group of verbs. The tool and results can be applied in the compilation of language resources for Chinese-English translation and corpus-based China studies.