Towards building a corpus for Palestinian dialect

Other Title(s)

نحو بناء مدونة للعامية الفلسطينية

Dissertant

Akra, Diyam Fuad

Thesis advisor

Jarrar, Mustafa

Comitee Members

Habash, Nizar
Arar, Mahdi
Hanani, Abu al-Suud

University

Birzeit University

Faculty

Faculty of Engineering and Technology

Department

Department of Computer Science

University Country

Palestine (West Bank)

Degree

Master

Degree Date

2015

English Abstract

The number of users of the internet is increasing exponentially every year; most of these users are using social networks, blogs and forums.

When users in the Arab world need communicate with each other, they often use their colloquial dialect Arabic instead of the Modern Standard Arabic (MSA).

To retrieve information about a specific topic, Modern Standard Arabic (MSA) Informational Retrieval will retrieve only MSA relevant data, but maybe there is helpful information published in Dialect Arabic, also if we need to directly translate from Dialect to English, this may be done by translating the Dialect to standard Arabic then to English.

This thesis aims to build an annotated corpus for Palestinian Dialect with Relevant Meta Data.

The proposed methodology includes studying linguistic facts about Palestinian dialect and comparing it with Modern Standard Arabic in terms of morphology, orthography and lexical.

As well as collecting Palestinian written text from different resources, then analyzing and annotating the corpus by using resources designed for IV Egyptian dialect, after that annotating manually a list of words that can’t be analyzed by existing resources, finally start using the existing annotation tool to double check over annotated corpus.

Main Subjects

Information Technology and Computer Science

Topics

No. of Pages

106

Table of Contents

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Literature review.

Chapter Three : Corpus collection.

Chapter Four : Corpus annotation.

Chapter Five : Conclusion and future work.

References.

American Psychological Association (APA)

Akra, Diyam Fuad. (2015). Towards building a corpus for Palestinian dialect. (Master's theses Theses and Dissertations Master). Birzeit University, Palestine (West Bank)
https://search.emarefa.net/detail/BIM-620217

Modern Language Association (MLA)

Akra, Diyam Fuad. Towards building a corpus for Palestinian dialect. (Master's theses Theses and Dissertations Master). Birzeit University. (2015).
https://search.emarefa.net/detail/BIM-620217

American Medical Association (AMA)

Akra, Diyam Fuad. (2015). Towards building a corpus for Palestinian dialect. (Master's theses Theses and Dissertations Master). Birzeit University, Palestine (West Bank)
https://search.emarefa.net/detail/BIM-620217

Language

English

Data Type

Arab Theses

Record ID

BIM-620217