September 5, 2008
OpenSolaris Project: Hadoop Live CD
Organizations routinely collect a huge amount of data, including web crawls, email messages, and scientific data. Processing these datasets with traditional relational database models or streaming algorithms is no longer scalable. A new data processing model, MapReduce, addresses this challenge by leveraging large clusters of hundreds or thousands of heterogeneous servers.