Application of Filters to Multiway Joins in MapReduce

Joint Authors

Kim, Hyoung-Joo
Lee, Taewhi
Im, Dong-Hyuk
Kim, Hangkyu

Source

Mathematical Problems in Engineering

Issue

Vol. 2014, Issue 2014 (31 Dec. 2014), pp.1-11, 11 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2014-03-04

Country of Publication

Egypt

No. of Pages

11

Main Subjects

Civil Engineering

Abstract EN

Joining multiple datasets in MapReduce may amplify the disk and network overheads because intermediate join results have to be written to the underlying distributed file system, or map output records have to be replicated multiple times.

This paper proposes a method for applying filters based on the processing order of input datasets, which is appropriate for the two types of multiway joins: common attribute joins and distinct attribute joins.

The number of redundant records filtered depends on the processing order.

In common attribute joins, the input records do not need to be replicated, so a set of filters is created, which are applied in turn.

In distinct attribute joins, the input records have to be replicated, so multiple sets of filters need to be created, which depend on the number of join attributes.

The experimental results showed that our approach outperformed a cascade of two-way joins and basic multiway joins in cases where small portions of input datasets were joined.

American Psychological Association (APA)

Lee, Taewhi& Im, Dong-Hyuk& Kim, Hangkyu& Kim, Hyoung-Joo. 2014. Application of Filters to Multiway Joins in MapReduce. Mathematical Problems in Engineering،Vol. 2014, no. 2014, pp.1-11.
https://search.emarefa.net/detail/BIM-457294

Modern Language Association (MLA)

Lee, Taewhi…[et al.]. Application of Filters to Multiway Joins in MapReduce. Mathematical Problems in Engineering No. 2014 (2014), pp.1-11.
https://search.emarefa.net/detail/BIM-457294

American Medical Association (AMA)

Lee, Taewhi& Im, Dong-Hyuk& Kim, Hangkyu& Kim, Hyoung-Joo. Application of Filters to Multiway Joins in MapReduce. Mathematical Problems in Engineering. 2014. Vol. 2014, no. 2014, pp.1-11.
https://search.emarefa.net/detail/BIM-457294

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-457294