Current position: Home >> Scientific Research >> Paper Publications

Improved word alignment in patent domain

Release Time:2019-03-11  Hits:

Indexed by: Conference Paper

Date of Publication: 2011-11-27

Included Journals: Scopus、EI

Page Number: 209-213

Abstract: This paper presents a new method for word alignment in patent domain which incorporates both generative and discriminative models. In this framework, the advantage of generative model that can learn large numbers of parameters from a sentence-aligned parallel corpus automatically in a unsupervised way can be kept, as well as get an improvement through discriminative models which can deploy various features in a supervised way. Even with only 300 word-aligned Chinese-English sentence pairs, incorporates with a 1M parallel Chinese-English patent sentences released by NTCIR9, experiments show that our method can get a promising performance. ? 2011 IEEE.

Prev One:Mining english-Chinese named entity pairs from comparable corpora

Next One:An English part-of-speech tagger for machine translation in business domain