A Character Level Based and Word Level Based Approach for Chinese-Vietnamese Machine Translation

Joint Authors

Nguyen, Hien T.
Tran, Phuoc
Dinh, Dien

Source

Computational Intelligence and Neuroscience

Issue

Vol. 2016, Issue 2016 (31 Dec. 2015), pp.1-11, 11 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2016-06-29

Country of Publication

Egypt

No. of Pages

11

Main Subjects

Biology

Abstract EN

Chinese and Vietnamese have the same isolated language; that is, the words are not delimited by spaces.

In machine translation, word segmentation is often done first when translating from Chinese or Vietnamese into different languages (typically English) and vice versa.

However, it is a matter for consideration that words may or may not be segmented when translating between two languages in which spaces are not used between words, such as Chinese and Vietnamese.

Since Chinese-Vietnamese is a low-resource language pair, the sparse data problem is evident in the translation system of this language pair.

Therefore, while translating, whether it should be segmented or not becomes more important.

In this paper, we propose a new method for translating Chinese to Vietnamese based on a combination of the advantages of character level and word level translation.

In addition, a hybrid approach that combines statistics and rules is used to translate on the word level.

And at the character level, a statistical translation is used.

The experimental results showed that our method improved the performance of machine translation over that of character or word level translation.

American Psychological Association (APA)

Tran, Phuoc& Dinh, Dien& Nguyen, Hien T.. 2016. A Character Level Based and Word Level Based Approach for Chinese-Vietnamese Machine Translation. Computational Intelligence and Neuroscience،Vol. 2016, no. 2016, pp.1-11.
https://search.emarefa.net/detail/BIM-1099829

Modern Language Association (MLA)

Tran, Phuoc…[et al.]. A Character Level Based and Word Level Based Approach for Chinese-Vietnamese Machine Translation. Computational Intelligence and Neuroscience Vol. 2016, no. 2016 (2015), pp.1-11.
https://search.emarefa.net/detail/BIM-1099829

American Medical Association (AMA)

Tran, Phuoc& Dinh, Dien& Nguyen, Hien T.. A Character Level Based and Word Level Based Approach for Chinese-Vietnamese Machine Translation. Computational Intelligence and Neuroscience. 2016. Vol. 2016, no. 2016, pp.1-11.
https://search.emarefa.net/detail/BIM-1099829

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1099829