Identification of Code-Switched Sentences and Words Using Language Modeling Approaches

Joint Authors

Yu, Liang-Chih
He, Wei-Cheng
Chien, Wei-Nan
Tseng, Yuen-Hsien

Source

Mathematical Problems in Engineering

Issue

Vol. 2013, Issue 2013 (31 Dec. 2013), pp.1-7, 7 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2013-09-30

Country of Publication

Egypt

No. of Pages

7

Main Subjects

Civil Engineering

Abstract EN

Globalization and multilingualism contribute to code-switching—the phenomenon in which speakers produce utterances containing words or expressions from a second language.

Processing code-switched sentences is a significant challenge for multilingual intelligent systems.

This study proposes a language modeling approach to the problem of code-switching language processing, dividing the problem into two subtasks: the detection of code-switched sentences and the identification of code-switched words in sentences.

A code-switched sentence is detected on the basis of whether it contains words or phrases from another language.

Once the code-switched sentences are identified, the positions of the code-switched words in the sentences are then identified.

Experimental results show that the language modeling approach achieved an F-measure of 80.43% and an accuracy of 79.01% for detecting Mandarin-Taiwanese code-switched sentences.

For the identification of code-switched words, the word-based and POS-based models, respectively, achieved F-measures of 41.09% and 53.08%.

American Psychological Association (APA)

Yu, Liang-Chih& He, Wei-Cheng& Chien, Wei-Nan& Tseng, Yuen-Hsien. 2013. Identification of Code-Switched Sentences and Words Using Language Modeling Approaches. Mathematical Problems in Engineering،Vol. 2013, no. 2013, pp.1-7.
https://search.emarefa.net/detail/BIM-1011079

Modern Language Association (MLA)

Yu, Liang-Chih…[et al.]. Identification of Code-Switched Sentences and Words Using Language Modeling Approaches. Mathematical Problems in Engineering No. 2013 (2013), pp.1-7.
https://search.emarefa.net/detail/BIM-1011079

American Medical Association (AMA)

Yu, Liang-Chih& He, Wei-Cheng& Chien, Wei-Nan& Tseng, Yuen-Hsien. Identification of Code-Switched Sentences and Words Using Language Modeling Approaches. Mathematical Problems in Engineering. 2013. Vol. 2013, no. 2013, pp.1-7.
https://search.emarefa.net/detail/BIM-1011079

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1011079