Reynold Xin is a Project Management Committee (PMC) member of Apache Spark, and a co-founder at Databricks, a company started by the creators of Spark. He recently led an effort at Databricks to scale up Spark and set a new world record in 100 TB sorting (Daytona Gray). Prior to Databricks, he was pursuing a PhD in databases at UC Berkeley AMPLab. He won two Best Demo Awards at VLDB 2011 and SIGMOD 2012, and also wrote the most cited paper in SIGMOD 2011 and in SIGMOD 2013.

Authored Content

Lots of people in a crowd.

Spark's new DataFrame API is inspired by data frames in R and Python (Pandas), but designed from...
Two different paths to different outcomes

How Databricks set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-...