SIDR: Structure-Aware Intelligent Data Routing in Hadoop

The MapReduce framework is being extended for domains quite different from the web applications for which it was designed, including the processing of big structured data, e.g., scientific and financial data. Previous work using MapReduce to process …

SciHadoop Semantic Compression

Structure-Aware Intelligent Data Routing in SciHadoop

SciHadoop: Array-based Query Processing in Hadoop

Hadoop has become the de facto platform for large-scale data analysis in commercial applications, and increasingly so in scientific applications. However, Hadoop's byte stream data model causes inefficiencies when used to process scientific data that …