Abstract: The big-data refers to the large-scale distributed data processing applications which works on exceptionally large amounts of data. Google’s MapReduce and Apache’s Hadoop, its open-source implementation, are the software systems for big-data applications. An observation of the MapReduce framework is that the framework generates a large amount of intermediate data. MapReduce is unable to utilize such data so they are thrown after used. We propose Dache, a data-aware cache framework used for big-data applications. In Dache, tasks submit their intermediate results to the cache manager and queries the cache manager before executing the actual computing work. A novel cache description system and a cache request and reply protocol are designed.
Keyword: Big-data, MapReduce, Hadoop, caching.
Title: A Data Aware Caching (Dache) for Big-Data Applications Using the MapReduce Framework
Author: Utkarsh Honey, Yogesh More, Prasad Wandhekar, Santosh Wayal, Prof. Jayashree Chaudhari
International Journal of Computer Science and Information Technology Research
ISSN 2348-1196 (print), SSN 2348-120X (online)
Research Publish Journals