Towards building a corpus for Palestinian dialect

العناوين الأخرى

نحو بناء مدونة للعامية الفلسطينية

مقدم أطروحة جامعية

Akra, Diyam Fuad

مشرف أطروحة جامعية

Jarrar, Mustafa

أعضاء اللجنة

Habash, Nizar
Arar, Mahdi
Hanani, Abu al-Suud

الجامعة

جامعة بيرزيت

الكلية

كلية الهندسة و التكنولوجيا

القسم الأكاديمي

دائرة علم الحاسوب

دولة الجامعة

فلسطين (الضفة الغربية)

الدرجة العلمية

ماجستير

تاريخ الدرجة العلمية

2015

الملخص الإنجليزي

The number of users of the internet is increasing exponentially every year; most of these users are using social networks, blogs and forums.

When users in the Arab world need communicate with each other, they often use their colloquial dialect Arabic instead of the Modern Standard Arabic (MSA).

To retrieve information about a specific topic, Modern Standard Arabic (MSA) Informational Retrieval will retrieve only MSA relevant data, but maybe there is helpful information published in Dialect Arabic, also if we need to directly translate from Dialect to English, this may be done by translating the Dialect to standard Arabic then to English.

This thesis aims to build an annotated corpus for Palestinian Dialect with Relevant Meta Data.

The proposed methodology includes studying linguistic facts about Palestinian dialect and comparing it with Modern Standard Arabic in terms of morphology, orthography and lexical.

As well as collecting Palestinian written text from different resources, then analyzing and annotating the corpus by using resources designed for IV Egyptian dialect, after that annotating manually a list of words that can’t be analyzed by existing resources, finally start using the existing annotation tool to double check over annotated corpus.

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

عدد الصفحات

106

قائمة المحتويات

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Literature review.

Chapter Three : Corpus collection.

Chapter Four : Corpus annotation.

Chapter Five : Conclusion and future work.

References.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Akra, Diyam Fuad. (2015). Towards building a corpus for Palestinian dialect. (Master's theses Theses and Dissertations Master). Birzeit University, Palestine (West Bank)
https://search.emarefa.net/detail/BIM-620217

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Akra, Diyam Fuad. Towards building a corpus for Palestinian dialect. (Master's theses Theses and Dissertations Master). Birzeit University. (2015).
https://search.emarefa.net/detail/BIM-620217

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Akra, Diyam Fuad. (2015). Towards building a corpus for Palestinian dialect. (Master's theses Theses and Dissertations Master). Birzeit University, Palestine (West Bank)
https://search.emarefa.net/detail/BIM-620217

لغة النص

الإنجليزية

نوع البيانات

رسائل جامعية

رقم السجل

BIM-620217