博文

目前显示的是标签为“Technology”的博文

Big Data on OpenStack

OpenStack provides a management framework or a suite of tools to deal with the management tasks on various kinds of physical resources (such as computing or networking resources, storage, and VM images) in data centers. It pursues to not only improve the utilization of physical resources but also makes the tasks of resource management and provision easier and more convenient. It can work on the top of many currently popular virtualization hypervisors like KVM, XEN, which provide the virtualization environment on a single host. Simply speaking, OpenStack is a centralized resource management tool in data centers. MapReduce/Hadoop is a framework to provide the capacity of efficiently processing a huge number of unstructured data in data centers. In MapReduce, a request/job is separated into multiple tasks that will be processed on distributed hosts, such that shorter completion time can be achieved. Therefore an effective way to process big data in data centers is deploying MapReduc