A Data Aware Caching (Dache) for Big-Data Applications Using the MapReduce Framework

Utkarsh Honey, Yogesh More, Prasad Wandhekar, Santosh Wayal, Prof. Jayashree Chaudhari

Abstract: The big-data refers to the large-scale distributed data processing applications which works on exceptionally large amounts of data. Google’s MapReduce and Apache’s Hadoop, its open-source implementation, are the software systems for big-data applications. An observation of the MapReduce framework is that the framework generates a large amount of intermediate data. MapReduce is unable to utilize such data so they are thrown after used. We propose Dache, a data-aware cache framework used for big-data applications. In Dache, tasks submit their intermediate results to the cache manager and queries the cache manager before executing the actual computing work. A novel cache description system and a cache request and reply protocol are designed.

Keyword: Big-data, MapReduce, Hadoop, caching.

Title: A Data Aware Caching (Dache) for Big-Data Applications Using the MapReduce Framework

Author: Utkarsh Honey, Yogesh More, Prasad Wandhekar, Santosh Wayal, Prof. Jayashree Chaudhari

International Journal of Computer Science and Information Technology Research

ISSN 2348-1196 (print), SSN 2348-120X (online)

Research Publish Journals

Vol. 3, Issue 4, October 2015 – December 2015

Citation
Share : Facebook Twitter Linked In

Citation
A Data Aware Caching (Dache) for Big-Data Applications Using the MapReduce Framework by Utkarsh Honey, Yogesh More, Prasad Wandhekar, Santosh Wayal, Prof. Jayashree Chaudhari