Text summarization technique for Punjabi language using neural networks

Joint Authors

Kaur, Amanpreet
Yadav, Divakar
Jain, Arti
Arora, Anuja
Morato, Jorge

Source

The International Arab Journal of Information Technology

Issue

Vol. 18, Issue 6 (30 Nov. 2021), pp.807-818, 12 p.

Publisher

Zarqa University Deanship of Scientific Research

Publication Date

2021-11-30

Country of Publication

Jordan

No. of Pages

12

Main Subjects

Information Technology and Computer Science

Abstract EN

In the contemporary world, utilization of digital content has risen exponentially.

For example, newspaper and web articles, status updates, advertisements etc.

have become an integral part of our daily routine.

Thus, there is a need to build an automated system to summarize such large documents of text in order to save time and effort.

Although, there are summarizers for languages such as English since the work has started in the 1950s and at present has led it up to a matured stage but there are several languages that still need special attention such as Punjabi language.

The Punjabi language is highly rich in morphological structure as compared to English and other foreign languages.

In this work, we provide three phase extractive summarization methodology using neural networks.

It induces compendious summary of Punjabi single text document.

The methodology incorporates pre-processing phase that cleans the text; processing phase that extracts statistical and linguistic features; and classification phase.

The classification based neural network applies an activation function- sigmoid and weighted error reduction-gradient descent optimization to generate the resultant output summary.

The proposed summarization system is applied over monolingual Punjabi text corpus from Indian languages corpora initiative phase-II.

The precision, recall and F-measure are achieved as 90.0%, 89.28% an 89.65% respectively which is reasonably good in comparison to the performance of other existing Indian languages’ summarizers.

American Psychological Association (APA)

Jain, Arti& Arora, Anuja& Yadav, Divakar& Morato, Jorge& Kaur, Amanpreet. 2021. Text summarization technique for Punjabi language using neural networks. The International Arab Journal of Information Technology،Vol. 18, no. 6, pp.807-818.
https://search.emarefa.net/detail/BIM-1430944

Modern Language Association (MLA)

Jain, Arti…[et al.]. Text summarization technique for Punjabi language using neural networks. The International Arab Journal of Information Technology Vol. 18, no. 6 (Nov. 2021), pp.807-818.
https://search.emarefa.net/detail/BIM-1430944

American Medical Association (AMA)

Jain, Arti& Arora, Anuja& Yadav, Divakar& Morato, Jorge& Kaur, Amanpreet. Text summarization technique for Punjabi language using neural networks. The International Arab Journal of Information Technology. 2021. Vol. 18, no. 6, pp.807-818.
https://search.emarefa.net/detail/BIM-1430944

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 815-817

Record ID

BIM-1430944