Hits:
Indexed by:期刊论文
Date of Publication:2010-09-01
Journal:Journal of Information and Computational Science
Included Journals:EI、Scopus
Volume:7
Issue:9
Page Number:1967-1986
ISSN No.:15487741
Abstract:We introduce an improved MI-based phrase alignment approach to match Chinese historical terms with their English translations. The proposed approach is motivated by the findings about the inadequacy of popular machine translation systems like Babelfish and Systran in translating Chinese historical terms, and observations about the discrepancies by established translators and Sinologists in handling these historical terms. Modifications to the basic MI alignment approach, including Bridging, Pruning and Head-word Extension, integrate linguistic knowledge of the translation patterns of Chinese historical terms into statistical alignment algorithm. The proposed framework for alignment achieves more than 17 and 10 points increase in percentage respectively above the base line in two individual corpora and detects a number of valid alternative translations for some top-frequency Chinese historical terms. The discovery also alerts us to the potential impact that different corpus may have for an SMT system from the perspective of readability and efficiency. ? 2010 Binary Information Press.