تطوير خوارزمية لحذف الصور شبه المكررة في Hadoop

Other Title(s)

Develop an algorithm to delete near-duplicate images in Hadoop

Joint Authors

زقزوق، عمار علي
حسن علي حسن

Source

مجلة العلوم الهندسية و تكنولوجيا المعلومات

Issue

Vol. 5, Issue 1 (31 Mar. 2021), pp.1-11, 11 p.

Publisher

National Research Center

Publication Date

2021-03-31

Country of Publication

Palestine (Gaza Strip)

No. of Pages

11

Main Subjects

Engineering & Technology Sciences (Multidisciplinary)

Abstract EN

The concept of near-duplicate images refers to images that are subjected to noise, that have been compressed, or whose resolution is reduced as a result of their transmission, and other images to which digital image operations are applied.

The ideal storage system aims to optimize the storage space, by managing, structuring and organizing data in an efficient manner, so that the storage space is preserved including valuable and useful information, and we get rid of useless data.

The space occupied by insignificant data is called wasted space, and this space increases with the increase of these files, resulting in a waste of storage space, which makes it difficult to manage storage space and organize data, which affects the overall system performance.

Hadoop is used to store and process big data, and depends on branching in storing data, as the data is divided into parts (blocks), and these parts are distributed in computer devices, called these devices (Data Nodes).

Researchers have developed techniques to get rid of fragments of duplicate data, in order to save storage space in the Hadoop system, but each node may contain unimportant files occupying part of this space, so we will present in this research a technique to delete the near-duplicate images stored within data nodes, using a Discrete Cosine Transform (DCT).

American Psychological Association (APA)

حسن علي حسن وزقزوق، عمار علي. 2021. تطوير خوارزمية لحذف الصور شبه المكررة في Hadoop. مجلة العلوم الهندسية و تكنولوجيا المعلومات،مج. 5، ع. 1، ص ص. 1-11.
https://search.emarefa.net/detail/BIM-1307880

Modern Language Association (MLA)

حسن علي حسن وزقزوق، عمار علي. تطوير خوارزمية لحذف الصور شبه المكررة في Hadoop. مجلة العلوم الهندسية و تكنولوجيا المعلومات مج. 5، ع. 1 (آذار 2021)، ص ص. 1-11.
https://search.emarefa.net/detail/BIM-1307880

American Medical Association (AMA)

حسن علي حسن وزقزوق، عمار علي. تطوير خوارزمية لحذف الصور شبه المكررة في Hadoop. مجلة العلوم الهندسية و تكنولوجيا المعلومات. 2021. مج. 5، ع. 1، ص ص. 1-11.
https://search.emarefa.net/detail/BIM-1307880

Data Type

Journal Articles

Language

Arabic

Notes

يتضمن مراجع ببليوجرافية : ص. 10-11

Record ID

BIM-1307880