Improving IO Efficiency in Hadoop-Based Massive Data Analysis Programs

المؤلفون المشاركون

Lee, Kyong-Ha
Kang, Woo Lam
Suh, Young-Kyoon

المصدر

Scientific Programming

العدد

المجلد 2018، العدد 2018 (31 ديسمبر/كانون الأول 2018)، ص ص. 1-9، 9ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2018-12-02

دولة النشر

مصر

عدد الصفحات

9

التخصصات الرئيسية

الرياضيات

الملخص EN

Apache Hadoop has been a popular parallel processing tool in the era of big data.

While practitioners have rewritten many conventional analysis algorithms to make them customized to Hadoop, the issue of inefficient I/O in Hadoop-based programs has been repeatedly reported in the literature.

In this article, we address the problem of the I/O inefficiency in Hadoop-based massive data analysis by introducing our efficient modification of Hadoop.

We first incorporate a columnar data layout into the conventional Hadoop framework, without any modification of the Hadoop internals.

We also provide Hadoop with indexing capability to save a huge amount of I/O while processing not only selection predicates but also star-join queries that are often used in many analysis tasks.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Lee, Kyong-Ha& Kang, Woo Lam& Suh, Young-Kyoon. 2018. Improving IO Efficiency in Hadoop-Based Massive Data Analysis Programs. Scientific Programming،Vol. 2018, no. 2018, pp.1-9.
https://search.emarefa.net/detail/BIM-1214653

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Lee, Kyong-Ha…[et al.]. Improving IO Efficiency in Hadoop-Based Massive Data Analysis Programs. Scientific Programming No. 2018 (2018), pp.1-9.
https://search.emarefa.net/detail/BIM-1214653

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Lee, Kyong-Ha& Kang, Woo Lam& Suh, Young-Kyoon. Improving IO Efficiency in Hadoop-Based Massive Data Analysis Programs. Scientific Programming. 2018. Vol. 2018, no. 2018, pp.1-9.
https://search.emarefa.net/detail/BIM-1214653

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1214653