大连理工大学主页平台管理系统李秀英 Extracting Historical Terms Based on Aligned Chinese-English Parallel Corpora Home

Current position: Home >> Scientific Research >> Paper Publications

Extracting Historical Terms Based on Aligned Chinese-English Parallel Corpora

Release Time:2019-03-11 Hits:

Indexed by: Conference Paper

Date of Publication: 2009-09-24

Included Journals: Scopus、CPCI-S、EI

Page Number: 296-301

Key Words: Historical term; extraction; parallel corpora; Chinese historical classics

Abstract: This paper examines the feasibility of implementing statistic-oriented term extraction and evaluation methods in extracting historical terms from aligned parallel corpora of Chinese historical classics and their translations. It proposes to take transliteration as anchor points to establish sentence-level alignment. It also investigates the approach to extract term translation pairs based on 4000 parallel sentences or segments of sentences from the corpora of the Chinese historical classic Shi Ji (Records of the Historian) and its English translations by two well-known translators. The experimental results indicate that the statistically sound algorithm can successfully extract those terms whose English translations are consistent throughout the corpus and those transliterated pairs, but fails to extract the translations of those terms that are translated differently by the two translators although the translations may be equally qualified in terms of their usage in the English language. The algorithm also fails to extract the top frequency terms which are ambiguous in meaning due to changes of its part of speech. Therefore, this paper suggests insights gained from the linguistic and translation studies perspectives can be integrated with the statistic measurements to improve the extraction and validating results.

Prev One:Writing Shi Ji into world literature —— On Burton Watson’s translation of Shi Ji (Records of the Historian)

Next One:术语与机器翻译——实验结果分析与术语数据库的构建

Home

Scientific Research

Teaching Research

Awards and Honours

Enrollment Information

Student Information

My Album

Blog

Extracting Historical Terms Based on Aligned Chinese-English Parallel Corpora