Tags: Apache Spark

Q&A: Hortonworks CTO Unfolds the Big Data Road Map

Hortonworks' Scott Gnau talks about Apache Spark vs. Hadoop and data in motion. Hortonworks has built its business on big data and Hadoop, but the Hortonworks Data Platform provides analytics and features support for a range of technologies beyond Hadoop, including MapReduce, Pig, Hive, and Spark....
Read 0 Comments

crystal-ball.jpg

Common Search
It is critical for the Internet to have both commercial and non-commercial search engines available, so that we can compare their results and watch out for biases. says Common Search founder Sylvain Zimmer in this preview to his talk at Apache: Big Data Europe.

Ranking the Web With Radical Transparency

Ranking every URL on the web in a transparent and reproducible way is a core concept of the Common Search project, says Sylvain Zimmer, who will be speaking at the upcoming Apache: Big Data Europe conference in Seville, Spain. The web has become a critical resource for humanity, and search engines...
Read 0 Comments

todd-moore-apachecon-2.jpg

Todd Moore
It became apparent that open source could be the engine to go out and drive things, said Todd Moore in his keynote at ApacheCon.

IBM’s Wager on Open Source Is Still Paying Off

When IBM got involved with the Linux open source project in 1998, they were betting that giving their code and time to the community would be a worthwhile investment. Now, 18 years later, IBM is more involved than ever, with more than 62,000 employees trained and expected to contribute to open...
Read 0 Comments

Spark-Powered Splice Machine Goes Open Source

Splice Machine, the relational SQL database system that uses Hadoop and Spark to provide high-speed results, is nowavailable in an open source edition. Version 2.0 of Splice Machine added Spark to speed up OLAP-style workloads while still processing conventional OLTP workloads with HBase. The open...
Read 0 Comments

All the Apache Streaming Projects: An Exploratory Guide

The speed at which data is generated, consumed, processed, and analyzed is increasing at an unbelievably rapid pace. Social media, the Internet of Things, ad tech, and gaming verticals are struggling to deal with the disproportionate size of data sets. These industries demand data processing and...
Read 0 Comments

matei-zaharia.jpg

Matei Zaharia
Apache Spark creator Matei Zaharia speaking at MesosCon North America.

Apache Spark Creator Matei Zaharia Describes Structured Streaming in Spark 2.0 [Video]

Apache Spark has been an integral part of Mesos from its inception. Spark is one of the most widely used big data processing systems for clusters. Matei Zaharia, the CTO of Databricks and creator of Spark, talked about Spark's advanced data analysis power and new features in its upcoming 2.0...
Read 0 Comments

MapR Shows Off Enterprise-Grade Spark Distribution

At Spark Summit in San Francisco, Calif., this week, Hadoop distribution vendor MapR Technologies announced a new enterprise-grade Apache Spark distribution. The new distribution, available now in both MapR Converged Community Edition and MapR Converged Enterprise Edition, includes the complete...
Read 0 Comments

IBM Launches Cloud-Based Development Environment for Apache Spark

IBM recently announced the Data Science Experience on its Bluemix cloud platform to help developers build intelligent applications and make data analytics more accessible to the enterprise.
Read 0 Comments

hot-air-1373167_1920.png

Projects
These data analytics projects are on the rise: Apache Grappa, Apache Drill, and Apache Kafka.

3 Emerging Open Source Data Analytics Tools Beyond Apache Spark

On the data analytics front, profound change is in the air, and open source tools are leading many of the changes. Sure, you are probably familiar with some of the open source stars in this space, such as Hadoop and Apache Spark, but there is now a strong need for new tools that can holistically...
Read 0 Comments

Microsoft, MapR Announce New Apache Spark-Based Releases

Microsoft, with its Hortonworks-based cloud Hadoop distro, and MapR with its own Hadoop-powered wares, each pivot toward Apache Spark.
Read 0 Comments

Pages

Click Here!