Improving extractive multi-document text summarization through multi-objective optimization

Other Title(s)

تحسين التخليص الاقتطاعي للمستندات النصية المتعددة من خلال أمثلية تعدد الأهداف

Joint Authors

Salih, Hilal Hadi
Kazim, Nisrin Jawad
Atiyyah, Bara Ali

Source

Iraqi Journal of Science

Issue

Vol. 59, Issue 4B (31 Dec. 2018), pp.2135-2149, 15 p.

Publisher

University of Baghdad College of Science

Publication Date

2018-12-31

Country of Publication

Iraq

No. of Pages

15

Main Subjects

Information Technology and Computer Science

Abstract EN

Multi-document summarization is an optimization problem demanding optimization of more than one objective function simultaneously.

The proposed work regards balancing of the two significant objectives: content coverage and diversity when generating summaries from a collection of text documents.

Any automatic text summarization system has the challenge of producing high quality summary.

Despite the existing efforts on designing and evaluating the performance of many text summarization techniques, their formulations lack the introduction of any model that can give an explicit representation of – coverage and diversity – the two contradictory semantics of any summary.

In this work, the design of generic text summarization model based on sentence extraction is redirected into more semantic measure reflecting individually both content coverage and content diversity as two explicit optimization models.

The problem is defined by projecting the first criterion, i.e.

content coverage in the light of text similarity.

The proposed model hypothesizes a possible decomposition of text similarity into three different levels of optimization formula.

First, aspire to global optimization, the candidate summary should cover the summary of the document collection.

Then, to attain, less global optimization, the sentences of the candidate summary should cover the summary of the document collection.

The third level of optimization is content with local optimization, where the difference between the magnitude of terms covered by the candidate summary and those of the document collection should be small.

This coverage model is coupled with a proposed diversity model and defined as a Multi-Objective Optimization (MOO) problem.

Moreover, heuristic perturbation and heuristic local repair operators have been proposed and injected into the adopted evolutionary algorithm to harness its strength.

Assessment of the proposed model has been performed using document sets supplied by Document Understanding Conference 2002 ( ) and a comparison has been made with other state-of-the-art methods.

Metric used to measure performance of the proposed work is Recall-Oriented Understudy for Gisting Evaluation ( ) toolkit.

Results obtained support strong proof for the effectiveness and the significant performance awarded to the proposed MOO model over other state-of-the-art models.

American Psychological Association (APA)

Kazim, Nisrin Jawad& Salih, Hilal Hadi& Atiyyah, Bara Ali. 2018. Improving extractive multi-document text summarization through multi-objective optimization. Iraqi Journal of Science،Vol. 59, no. 4B, pp.2135-2149.
https://search.emarefa.net/detail/BIM-897016

Modern Language Association (MLA)

Kazim, Nisrin Jawad…[et al.]. Improving extractive multi-document text summarization through multi-objective optimization. Iraqi Journal of Science Vol. 59, no. 4B (2018), pp.2135-2149.
https://search.emarefa.net/detail/BIM-897016

American Medical Association (AMA)

Kazim, Nisrin Jawad& Salih, Hilal Hadi& Atiyyah, Bara Ali. Improving extractive multi-document text summarization through multi-objective optimization. Iraqi Journal of Science. 2018. Vol. 59, no. 4B, pp.2135-2149.
https://search.emarefa.net/detail/BIM-897016

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 2148-2149

Record ID

BIM-897016