Big data Big data feed

Learn how open source tools are powering the big data and analytics revolution.

Net catching 1s and 0s or data in the clouds

Route real-time events from web, mobile, and server-side app sources to help build your customer data lake on your data warehouse.
metrics and data shown on a computer screen

Since its creation in 2015 at an Airbnb hackathon, Apache Superset has matured into a leading open source BI solution.
Person standing in front of a giant computer screen with numbers, data

As an open source alternative to Segment, RudderStack collects and routes event stream (or clickstream) data and automatically builds your customer data lake on your data warehouse.
Looking at a map

Learn how open source data science languages, libraries, and tools are helping us understand our world better by reviewing 2020's top 10 data science articles on Opensource.com.
Alarm clocks with different time

Open source is leading the way with a rich canvas of projects for processing real-time events.
Houses in a row

NGT is a high-performing, open source deep learning library for large-scale and high-dimensional vectors.
computer screen

Let's look closely at the Apache Hive and Apache HBase to understand which one can cope better with query performance.
Person standing in front of a giant computer screen with numbers, data

Case study with NASA logs to show how Spark can be leveraged for analyzing data at scale.