How to wrangle log data with Python and Apache Spark Case study with NASA logs to show how Spark can be leveraged for analyzing data at scale.
How to use Spark SQL: A hands-on tutorial This tutorial explains how to leverage relational databases at scale using Spark SQL and DataFrames.
9 resources for data science projects The most novice engineer can start on a path towards data science mastery in this new age where data science skills will be needed at every level.
Why data scientists love Kubernetes Kubernetes' features that streamline the software development workflow also support the data science workflow.
Erase unconscious bias from your AI datasets Biased training datasets can produce serious consequences in people's lives, explains All Things Open Lightning Talk speaker.
9 obscure Python libraries for data science Go beyond pandas, scikit-learn, and matplotlib and learn some new tricks for doing data science in Python.