Apache Spark Apache Spark feed

Apache Spark is an open source cluster computing framework that is frequently used in big data processing. 

An introduction to data processing with Cassandra and Spark

Dani and Jon will give a three hour tutorial at OSCON this year called: Becoming friends with...

Apache Spark is an open source cluster computing framework. In contrast to Hadoop’s two-stage disk-...

ApacheCon is coming up, and within that massive conference there will be a glimmering gem: a forum...
shapes of women and men illustration

Spark's new DataFrame API is inspired by data frames in R and Python (Pandas), but designed from...

How Databricks set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-...