Identification of Code-Switched Sentences and Words Using Language Modeling Approaches

المؤلفون المشاركون

Yu, Liang-Chih
He, Wei-Cheng
Chien, Wei-Nan
Tseng, Yuen-Hsien

المصدر

Mathematical Problems in Engineering

العدد

المجلد 2013، العدد 2013 (31 ديسمبر/كانون الأول 2013)، ص ص. 1-7، 7ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2013-09-30

دولة النشر

مصر

عدد الصفحات

7

التخصصات الرئيسية

هندسة مدنية

الملخص EN

Globalization and multilingualism contribute to code-switching—the phenomenon in which speakers produce utterances containing words or expressions from a second language.

Processing code-switched sentences is a significant challenge for multilingual intelligent systems.

This study proposes a language modeling approach to the problem of code-switching language processing, dividing the problem into two subtasks: the detection of code-switched sentences and the identification of code-switched words in sentences.

A code-switched sentence is detected on the basis of whether it contains words or phrases from another language.

Once the code-switched sentences are identified, the positions of the code-switched words in the sentences are then identified.

Experimental results show that the language modeling approach achieved an F-measure of 80.43% and an accuracy of 79.01% for detecting Mandarin-Taiwanese code-switched sentences.

For the identification of code-switched words, the word-based and POS-based models, respectively, achieved F-measures of 41.09% and 53.08%.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Yu, Liang-Chih& He, Wei-Cheng& Chien, Wei-Nan& Tseng, Yuen-Hsien. 2013. Identification of Code-Switched Sentences and Words Using Language Modeling Approaches. Mathematical Problems in Engineering،Vol. 2013, no. 2013, pp.1-7.
https://search.emarefa.net/detail/BIM-1011079

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Yu, Liang-Chih…[et al.]. Identification of Code-Switched Sentences and Words Using Language Modeling Approaches. Mathematical Problems in Engineering No. 2013 (2013), pp.1-7.
https://search.emarefa.net/detail/BIM-1011079

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Yu, Liang-Chih& He, Wei-Cheng& Chien, Wei-Nan& Tseng, Yuen-Hsien. Identification of Code-Switched Sentences and Words Using Language Modeling Approaches. Mathematical Problems in Engineering. 2013. Vol. 2013, no. 2013, pp.1-7.
https://search.emarefa.net/detail/BIM-1011079

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1011079