NexComm 2014
February 23 - 27, 2014
Nice, France

DigitalWorld 2014
March 23 - 27, 2014
Barcelona, Spain

InfoSys 2014
April 20 - 24, 2014
Chamonix, France

BioSciencesWorld 2014
April 20 - 24, 2014
Chamonix, France

ComputationWorld 2014
May 25 - 29, 2014
Venice, Italy

InfoWare 2014
June 22 - 26, 2014
Seville, Spain

DataSys 2014
July 20 - 24, 2014
Paris, France

NexTech 2014
August 24 - 28, 2014
Rome, Italy

SoftNet 2014
October 12 - 16, 2014
Nice, France

NetWare 2014
November 16 - 20, 2014
Lisbon, Portugal

 

ThinkMind // CLOUD COMPUTING 2010, The First International Conference on Cloud Computing, GRIDs, and Virtualization // View article cloud_computing_2010_3_20_50031


The Limitation of MapReduce: A Probing Case and a Lightweight Solution

Authors:
Zhiqiang Ma
Lin Gu

Keywords: Distributed computing; Parallel architectures

Abstract:
MapReduce is arguably the most successful parallelization framework especially for processing large data sets in datacenters comprising commodity computers. However, difficulties are observed in porting sophisticated applications to MapReduce, albeit the existence of numerous parallelization opportunities. Intrinsically, the MapReduce design allows a program to scale up to handle extremely large data sets, but constrains a program's ability to process smaller data items and exploit variable-degrees of parallelization opportunities which are likely to be the common case in general application. In this paper, we analyze the limitations of MapReduce and present the design and implementation of a new lightweight parallelization framework, MRlite. MRlite can efficiently process moderatesize data with dependences among numerous computational steps. In the mean time, the parallelization on each step emulates the MapReduce model. Hence, the MRlite framework can also scale up for large data sets if massive parallelism with minimal dependence exists. MRlite can significantly improve the flexibility and parallel execution performance for a number of typical programs. Our evaluation shows that MRlite is one order of magnitude faster than Hadoop on problems that MapReduce has difficulty in handling.

Pages: 68 to 73

Copyright: Copyright (c) IARIA, 2010

Publication date: November 21, 2010

Published in: conference

ISSN: 2308-4294

ISBN: 978-1-61208-106-9

Location: Lisbon, Portugal

Dates: from November 21, 2010 to November 26, 2010

SERVICES CONTACT
2010 - 2014 © ThinkMind. All rights reserved.
Read Terms of Service and Privacy Policy.