Lossless text compression technique using syllable based morphology
Joint Authors
Akman, Ibrahim
Bayindir, Hakan
Ozleme, Serkan
Akin, Zehra
Misra, Sanjay
Source
The International Arab Journal of Information Technology
Issue
Vol. 8, Issue 1 (31 Jan. 2011), pp.66-74, 9 p.
Publisher
Publication Date
2011-01-31
Country of Publication
Jordan
No. of Pages
9
Main Subjects
Information Technology and Computer Science
Topics
Abstract EN
In this paper, we present a new lossless text compression technique which utilizes syllable-based morphology of multi-syllabic languages.
The proposed algorithm is designed to partition words into its syllables and then to produce their shorter bit representations for compression.
The method has six main components namely source file, filtering unit, syllable unit, compression unit, dictionary file and target file.
The number of bits in coding syllables depends on the number of entries in the dictionary file.
The proposed algorithm is implemented and tested using 20 different texts of different lengths collected from different fields.
The results indicated a compression of up to 43%.
American Psychological Association (APA)
Akman, Ibrahim& Bayindir, Hakan& Ozleme, Serkan& Akin, Zehra& Misra, Sanjay. 2011. Lossless text compression technique using syllable based morphology. The International Arab Journal of Information Technology،Vol. 8, no. 1, pp.66-74.
https://search.emarefa.net/detail/BIM-244524
Modern Language Association (MLA)
Akman, Ibrahim…[et al.]. Lossless text compression technique using syllable based morphology. The International Arab Journal of Information Technology Vol. 8, no. 1 (Jan. 2011), pp.66-74.
https://search.emarefa.net/detail/BIM-244524
American Medical Association (AMA)
Akman, Ibrahim& Bayindir, Hakan& Ozleme, Serkan& Akin, Zehra& Misra, Sanjay. Lossless text compression technique using syllable based morphology. The International Arab Journal of Information Technology. 2011. Vol. 8, no. 1, pp.66-74.
https://search.emarefa.net/detail/BIM-244524
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references : p. 73-74
Record ID
BIM-244524