Identification of Code-Switched Sentences and Words Using Language Modeling Approaches
Joint Authors
Yu, Liang-Chih
He, Wei-Cheng
Chien, Wei-Nan
Tseng, Yuen-Hsien
Source
Mathematical Problems in Engineering
Issue
Vol. 2013, Issue 2013 (31 Dec. 2013), pp.1-7, 7 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2013-09-30
Country of Publication
Egypt
No. of Pages
7
Main Subjects
Abstract EN
Globalization and multilingualism contribute to code-switching—the phenomenon in which speakers produce utterances containing words or expressions from a second language.
Processing code-switched sentences is a significant challenge for multilingual intelligent systems.
This study proposes a language modeling approach to the problem of code-switching language processing, dividing the problem into two subtasks: the detection of code-switched sentences and the identification of code-switched words in sentences.
A code-switched sentence is detected on the basis of whether it contains words or phrases from another language.
Once the code-switched sentences are identified, the positions of the code-switched words in the sentences are then identified.
Experimental results show that the language modeling approach achieved an F-measure of 80.43% and an accuracy of 79.01% for detecting Mandarin-Taiwanese code-switched sentences.
For the identification of code-switched words, the word-based and POS-based models, respectively, achieved F-measures of 41.09% and 53.08%.
American Psychological Association (APA)
Yu, Liang-Chih& He, Wei-Cheng& Chien, Wei-Nan& Tseng, Yuen-Hsien. 2013. Identification of Code-Switched Sentences and Words Using Language Modeling Approaches. Mathematical Problems in Engineering،Vol. 2013, no. 2013, pp.1-7.
https://search.emarefa.net/detail/BIM-1011079
Modern Language Association (MLA)
Yu, Liang-Chih…[et al.]. Identification of Code-Switched Sentences and Words Using Language Modeling Approaches. Mathematical Problems in Engineering No. 2013 (2013), pp.1-7.
https://search.emarefa.net/detail/BIM-1011079
American Medical Association (AMA)
Yu, Liang-Chih& He, Wei-Cheng& Chien, Wei-Nan& Tseng, Yuen-Hsien. Identification of Code-Switched Sentences and Words Using Language Modeling Approaches. Mathematical Problems in Engineering. 2013. Vol. 2013, no. 2013, pp.1-7.
https://search.emarefa.net/detail/BIM-1011079
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-1011079