location: Current position: Home >> Scientific Research >> Paper Publications

A Three-Layered Collocation Extraction Tool and Its Application in China English Studies

Hits:

Indexed by:会议论文

Date of Publication:2015-11-13

Included Journals:EI、CPCI-S、Scopus

Volume:9427

Page Number:38-49

Key Words:Collocation extraction; Dependency relation; China English

Abstract:We design a three-layered collocation extraction tool by integrating syntactic and semantic knowledge and apply it in China English studies. The tool first extracts peripheral collocations in the frequency layer from dependency triples, then extracts semi-peripheral collocations in the syntactic layer by association measures, and last extracts core collocations in the semantic layer with a similar word thesaurus. The syntactic constraints filter out much noise from surface co-occurrences, and the semantic constraints are effective in identifying the very "core" collocations. The tool is applied to automatically extract collocations from a large corpus of China English we compile to explore how China English as a variety of English is nativilized. Then we analyze similarities and differences of the typical China English collocations of a group of verbs. The tool and results can be applied in the compilation of language resources for Chinese-English translation and corpus-based China studies.

Pre One:基于简单名词短语的汉语介词短语识别研究

Next One:Exploring Recurrent Neural Networks to Detect Named Entities from Biomedical Text