The impact of virtualization on high performance computing clustering in the cloud

مقدم أطروحة جامعية

Achahbar, Ouidad

مشرف أطروحة جامعية

Radwan, Muhammad

الجامعة

جامعة الأخوين

الكلية

كلية العلوم و الهندسة

دولة الجامعة

المغرب

الدرجة العلمية

ماجستير

تاريخ الدرجة العلمية

2014

الملخص الإنجليزي

The ongoing pervasiveness of Internet access is largely increasing big data production.

This, in turn, increases demand on compute power to process the massive data, and thus rendering High Performance Computing (HPC) into a high solicited service.

Based on the paradigm of providing computing as a utility, the cloud is offering user-friendly infrastructures for processing these big data, e.g., High Performance Computing as a Service (HPCaaS).

Still, HPCaaS performance is tightly coupled with the underlying virtualization technique since the latter controls the creation of virtual machines instances that carry data processing jobs.

In this thesis, we characterize and evaluate the impact of machine virtualization on HPCaaS.

We track HPC performance under different cloud virtualization platforms, namely KVM and VMware ESXi, and compare it to the performance in a physical computing cluster infrastructure.

The virtualized environment is deployed using Hadoop on top of Openstack.

The resulting HPCaaS runs MapReduce algorithms on benchmarked big data samples using a granularity of 8 physical machines per cluster.

We got several interesting results when we ran the selected benchmarks on virtualized and physical cluster.

Each tested cluster provided different performance trends.

Yet, the overall analysis of the research findings proved that the selection of virtualization technology can lead to significant improvements when running and handling HPCaaS.

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

عدد الصفحات

170

قائمة المحتويات

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Cloud computing.

Chapter Three : Virtualization.

Chapter Four : Big data and high performance computing as a service.

Chapter Five : Literature review and research contribution .

Chapter Six : Technology enablers selection.

Chapter Seven : Openstack.

Chapter Eight : Hadoop.

Chapter Nine : Research methodology.

Chapter Ten : Experimental setup.

Chapter Eleven : Experimental results.

Chapter Twelve : Discussion.

Chapter Thirteen : Conclusion and future work.

References.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Achahbar, Ouidad. (2014). The impact of virtualization on high performance computing clustering in the cloud. (Master's theses Theses and Dissertations Master). Al Akhawayn University, Morocco
https://search.emarefa.net/detail/BIM-629581

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Achahbar, Ouidad. The impact of virtualization on high performance computing clustering in the cloud. (Master's theses Theses and Dissertations Master). Al Akhawayn University. (2014).
https://search.emarefa.net/detail/BIM-629581

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Achahbar, Ouidad. (2014). The impact of virtualization on high performance computing clustering in the cloud. (Master's theses Theses and Dissertations Master). Al Akhawayn University, Morocco
https://search.emarefa.net/detail/BIM-629581

لغة النص

الإنجليزية

نوع البيانات

رسائل جامعية

رقم السجل

BIM-629581