Cloudera and Others Rally Behind Hadoop Challenger Spark

127

Folks in the Big Data and Hadoop communities are becoming increasingly interested in Apache Spark, an open source data analytics cluster computing framework originally developed in the AMPLab at UC Berkeley. We’ve covered Spark before, and some reports are characterizing it as a tool that could supplant Hadoop in many enterprises.

According to Apache, Spark can run programs up to 100 times faster than Hadoop MapReduce in memory, and ten times faster on disk. When crunching large data sets, those are big performance differences. 

Read more at Ostatic