Systematic Mapping Study on Performance Scalability in Big Data on Cloud Using VM and Container

In recent years, big data and cloud computing have gained importance in IT and business. These two technologies are becoming complementing in a way that the former requires large amount of storage and computation power, which are the key enabler technolog

PDF / 590,488 Bytes
8 Pages / 439.37 x 666.142 pts Page_size
110 Downloads / 220 Views

DOWNLOAD

REPORT

Institute of Natural and Applied Sciences, Atilim University, Ankara, Turkey [email protected] 2 Faculty of Engineering, Atilim University, Ankara, Turkey {ziya.karakaya,ali.yazici}@atilim.edu.tr

Abstract. In recent years, big data and cloud computing have gained importance in IT and business. These two technologies are becoming complementing in a way that the former requires large amount of storage and computation power, which are the key enabler technologies of Big Data; the latter, cloud computing, brings the opportunity to scale ondemand computation power and provides massive quantities of storage space. Until recently, the only technique used in computation resource utilization was based on the hypervisor, which is used to create the virtual machine. Nowadays, another technique, which claims better resource utilization, called “container” is becoming popular. This technique is otherwise known as “lightweight virtualization” since it creates completely isolated virtual environments on top of underlying operating systems. The main objective of this study is to clarify the research area concerned with performance issues using VM and container in big data on cloud, and to give a direction for future research.

1

Introduction

Big data applications continue to receive an ever-increasing amount of attention, thus they become a dominant class of applications deployed over virtualized environments [1]. On the other hand, the resource utilization feature of cloud computing is mostly based on virtualization techniques, which is the common way to run diﬀerent services on the cloud [2]. By combining these two, most of the big data on cloud environments are using hypervisor to provision the virtual machines. In this technique, the VMs have their own operating systems which run on the virtual hardware resources provided by hypervisor [3]. Although it is proven to be a very useful technique in resource utilization, still there is an inherent overhead because of the hypervisor [1]. In recent years, containers, which are also called “lightweight virtualization”, are gaining popularity due to their ability to oﬀer superior performance because they do not have their own operating systems [2]. Instead, they use the OS kernel underlined with the host machine and they work similar to a regular c IFIP International Federation for Information Processing 2016 Published by Springer International Publishing Switzerland 2016. All Rights Reserved L. Iliadis and I. Maglogiannis (Eds.): AIAI 2016, IFIP AICT 475, pp. 634–641, 2016. DOI: 10.1007/978-3-319-44944-9 56

Systematic Mapping Study on Performance Scalability

635

application and are completely isolated from each other as well as from the underlying system. This technique receives its popularity mostly in Linux OS virtualization, since it uses the features provided by Linux OS kernel itself, such as “cgroup”, “namespace”, etc., in order to completely isolate each container from the rest. In this study, along with the other research questions, the main purpose of investigat

Data Loading...

Systematic Mapping Study on Performance Scalability in Big Data on Cloud Using VM and Container

Recommend Documents

Predicting the performance of big data applications on the cloud

A Study on Blockchain Scalability

Big Data and Cloud Computing

Advances in Big Data and Cloud Computing

An Analytical Study on Importance of SLA for VM Migration Algorithm and Start-Ups in Cloud

Cloud Allocation and Consolidation Based on a Scalability Metric

Big Data Service of Financial Law Based on Cloud Computing

Cloud Networking for Big Data

Study on the Network Information Security Problems Under the Environment of Big Data Cloud Computing

PLMSys: A Cloud Monitoring System Based on Cluster Performance and Container Logs

Converging HPC, Big Data and Cloud Technologies for Precision Agriculture Data Analytics on Supercomputers

Systematic mapping study on domain-specific language development tools