Data Placement Algorithm for Improving IO Load Balance without Using Popularity Information

Joint Authors

Gui, Xiaolin
Luo, Xiangyu
Xin, Gang

Source

Mathematical Problems in Engineering

Issue

Vol. 2019, Issue 2019 (31 Dec. 2019), pp.1-10, 10 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2019-01-17

Country of Publication

Egypt

No. of Pages

10

Main Subjects

Civil Engineering

Abstract EN

Data placement considerably affects the I/O performance of distributed storage systems such as HDFS.

An ideal placement algorithm should keep the I/O load evenly distributed among different storage nodes.

Most of the existing placement algorithms with I/O load balance guarantee depend on the information of data popularity to make the placement decisions.

However, the popularity information is typically not available in the data placement phase.

Furthermore, it usually varies during the data lifecycle.

In this paper, we propose a new placement algorithm called Balanced Distribution for Each Age Group (BEAG), which makes data placement decisions in the absence of the popularity information.

This algorithm maintains multiple counters for each storage node, with each counter representing the amount of data belonging to a certain age group.

It ensures that the data in each age group are equally scattered among the different storage nodes.

As the popularity variance of the data belonging to the same age group is considerably smaller than that of the entire data, BEAG significantly improves the I/O load balance.

Experimental results show that compared to other popularity independent algorithms, BEAG decreases the I/O load standard deviation by 11.6% to 30.4%.

American Psychological Association (APA)

Luo, Xiangyu& Xin, Gang& Gui, Xiaolin. 2019. Data Placement Algorithm for Improving IO Load Balance without Using Popularity Information. Mathematical Problems in Engineering،Vol. 2019, no. 2019, pp.1-10.
https://search.emarefa.net/detail/BIM-1194848

Modern Language Association (MLA)

Luo, Xiangyu…[et al.]. Data Placement Algorithm for Improving IO Load Balance without Using Popularity Information. Mathematical Problems in Engineering No. 2019 (2019), pp.1-10.
https://search.emarefa.net/detail/BIM-1194848

American Medical Association (AMA)

Luo, Xiangyu& Xin, Gang& Gui, Xiaolin. Data Placement Algorithm for Improving IO Load Balance without Using Popularity Information. Mathematical Problems in Engineering. 2019. Vol. 2019, no. 2019, pp.1-10.
https://search.emarefa.net/detail/BIM-1194848

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1194848