Show simple item record

dc.contributor.advisorThomas, Johnson P.
dc.contributor.authorAina, Ademola Chukwudi
dc.date.accessioned2016-04-15T21:49:07Z
dc.date.available2016-04-15T21:49:07Z
dc.date.issued2014-12-01
dc.identifier.urihttps://hdl.handle.net/11244/33385
dc.description.abstractHadoop MapReduce is a parallel, distributed programming model for processing large data sets or so-called Big data, on a cluster. The basic idea of MapReduce is to split the large input data set into many small pieces and assign these pieces to different devices for processing [5]. In this thesis, we took a look at performance evaluation of the MapReduce framework. MapReduce can be improved to perform speculative execution with maximum performance. Thus, optimizing the cost of computation and cost of communication will help achieve better performance. These optimizations are done by measuring the processing power of each machine and distributing task based on the capacity of each machine. The second step, measure he communication overheads and distribute tasks in the system for a given job or workload. To this end, we represent the Hadoop MapReduce execution with a functional model, and develop an optimization model for performance improvement in the system. Our experiments show that the proposed developed optimization functional model outperforms the regular functional model of the Hadoop MapReduce system by a factor of 2.
dc.formatapplication/pdf
dc.languageen_US
dc.publisherOklahoma State University
dc.rightsCopyright is held by the author who has granted the Oklahoma State University Library the non-exclusive right to share this material in its institutional repository. Contact Digital Library Services at lib-dls@okstate.edu or 405-744-9161 for the permission policy on the use, reproduction or distribution of this material.
dc.titleImproving Performance in Hadoop Mapreduce
dc.typetext
dc.contributor.committeeMemberGeorge, K. M.
dc.contributor.committeeMemberCline, David
osu.filenameAINA_okstate_0664M_13806.pdf
osu.accesstypeOpen Access
dc.description.departmentComputer Science
dc.type.genreThesis


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record