Distributed Data Management Using MapReduce
FENG LI , National University of Singapore
BENG CHIN OOI , National University of Singapore
M. TAMER ZSU, University of Waterloo
SAI WU, Zhejiang University
MapReduce is a framework for processing and managing large scale data sets in a distributed cluster, which has been used for applications such as generating search indexes, document clustering, access log analysis,and various other forms of data analytics.