Efficient parallel compression and decompression for large XML files

Joint Authors

Khan, Minhaj
Ali, Muhammad

Source

The International Arab Journal of Information Technology

Issue

Vol. 13, Issue 4 (31 Jul. 2016)6 p.

Publisher

Zarqa University

Publication Date

2016-07-31

Country of Publication

Jordan

No. of Pages

6

Main Subjects

Information Technology and Computer Science

Topics

Abstract EN

eXtensible Markup Language (XML) is gaining popularity and is being used widely on internet for storing and exchanging data.

Large XML files when transferred on network create bottleneck and also degrade the query performance.

Therefore, efficient mechanisms of compression and decompression are applied to XML files.

In this paper, an algorithm for performing XML compression and decompression is suggested.

The suggested approach reads an XML file, removes tags, divides the XML file into different parts and then compresses each different part on a separate core for achieving efficiency.

We compare performance results of the proposed algorithm with parallel compression and decompression of XML files using GZIP.

The performance results show that the suggested algorithm performs 24%, 53% and 72 % better than the parallel GZIP compression and decompression on Intel Xeon, Intel core i7 and Intel core i3 based architectures respectively.

American Psychological Association (APA)

Ali, Muhammad& Khan, Minhaj. 2016. Efficient parallel compression and decompression for large XML files. The International Arab Journal of Information Technology،Vol. 13, no. 4.
https://search.emarefa.net/detail/BIM-655050

Modern Language Association (MLA)

Ali, Muhammad& Khan, Minhaj. Efficient parallel compression and decompression for large XML files. The International Arab Journal of Information Technology Vol. 13, no. 4 (Jul. 2016).
https://search.emarefa.net/detail/BIM-655050

American Medical Association (AMA)

Ali, Muhammad& Khan, Minhaj. Efficient parallel compression and decompression for large XML files. The International Arab Journal of Information Technology. 2016. Vol. 13, no. 4.
https://search.emarefa.net/detail/BIM-655050

Data Type

Journal Articles

Language

English

Notes

Includes appendices.

Record ID

BIM-655050