Big data and open source go hand in hand

An introduction to big data from

Server room
Image by : 

Cory Doctorow. Modified by CC BY-SA 2.0.


Subscribe now

Get the highlights in your inbox every week.

Big data. It has certainly been a buzzword in recent years, but what is it really, and how are organizations leveraging open source tools to turn raw data into actionable insights?

At, a core piece of our mission is to keep you informed about trends and technologies where open source is making a difference. To help with that, we've created a new resource page which brings you up to speed with big data and some of the open source tools which businesses, governments, and organizations of all types are leveraging to make sense of huge quantities of bits and bytes.

If you've been wondering what big data is, how you can make use of it, and how it's changing the way we look at the world by bringing us information never before possible, we're here to help. In addition to bringing some sense to big data, we also look at:

  • How is open source making big data discoveries possible?
  • What is the MapReduce algorithm, and how does is make distributed computing possible?
  • What is Apache Hadoop, and how has it become the mainstay of many data scientists' processing needs?
  • What is Apache Spark, the new kid on the block, and how does it fit into the big picture of data processing?

We hope you'll check it out. If you find our resource helpful, please feel free to share it with your friends, family, and colleagues. And if you've got a big data question, let us know so we can continue to improve and build out this resource.

About the author - publishes stories about creating, adopting, and sharing open source solutions. Follow us on Twitter @opensourceway.