Text summarization technique for Punjabi language using neural networks

المؤلفون المشاركون

Kaur, Amanpreet
Yadav, Divakar
Jain, Arti
Arora, Anuja
Morato, Jorge

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 18، العدد 6 (30 نوفمبر/تشرين الثاني 2021)، ص ص. 807-818، 12ص.

الناشر

جامعة الزرقاء عمادة البحث العلمي

تاريخ النشر

2021-11-30

دولة النشر

الأردن

عدد الصفحات

12

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

In the contemporary world, utilization of digital content has risen exponentially.

For example, newspaper and web articles, status updates, advertisements etc.

have become an integral part of our daily routine.

Thus, there is a need to build an automated system to summarize such large documents of text in order to save time and effort.

Although, there are summarizers for languages such as English since the work has started in the 1950s and at present has led it up to a matured stage but there are several languages that still need special attention such as Punjabi language.

The Punjabi language is highly rich in morphological structure as compared to English and other foreign languages.

In this work, we provide three phase extractive summarization methodology using neural networks.

It induces compendious summary of Punjabi single text document.

The methodology incorporates pre-processing phase that cleans the text; processing phase that extracts statistical and linguistic features; and classification phase.

The classification based neural network applies an activation function- sigmoid and weighted error reduction-gradient descent optimization to generate the resultant output summary.

The proposed summarization system is applied over monolingual Punjabi text corpus from Indian languages corpora initiative phase-II.

The precision, recall and F-measure are achieved as 90.0%, 89.28% an 89.65% respectively which is reasonably good in comparison to the performance of other existing Indian languages’ summarizers.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Jain, Arti& Arora, Anuja& Yadav, Divakar& Morato, Jorge& Kaur, Amanpreet. 2021. Text summarization technique for Punjabi language using neural networks. The International Arab Journal of Information Technology،Vol. 18, no. 6, pp.807-818.
https://search.emarefa.net/detail/BIM-1430944

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Jain, Arti…[et al.]. Text summarization technique for Punjabi language using neural networks. The International Arab Journal of Information Technology Vol. 18, no. 6 (Nov. 2021), pp.807-818.
https://search.emarefa.net/detail/BIM-1430944

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Jain, Arti& Arora, Anuja& Yadav, Divakar& Morato, Jorge& Kaur, Amanpreet. Text summarization technique for Punjabi language using neural networks. The International Arab Journal of Information Technology. 2021. Vol. 18, no. 6, pp.807-818.
https://search.emarefa.net/detail/BIM-1430944

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 815-817

رقم السجل

BIM-1430944