apache

Spark's new DataFrame API is inspired by data frames in R and Python (Pandas), but designed from the ground up to support modern big data and data science applications.
1 comment Posted 26 Mar 2015 by Reynold Xin Feed
How does OpenStack differ from other large, popular open source projects and how do these differences affect the way the project is growing and maturing?
1 comment Posted 24 Mar 2015 by Stephen R. Walli Feed
Initially, Hadoop implementation required skilled teams of engineers and data scientists, making Hadoop too costly and cumbersome for many organizations. Now, thanks to a number of open source projects, big data analytics with Hadoop has become much more affordable and mainstream. Here's a look at... Read more
2 comments Posted 4 Mar 2015 by Jonathan Buckley Feed
Five stars for Top 5 articles of the week
The Opensource.com Weekly Top 5: the best and brightest burning star articles from this week: January 26 - 30
0 comments Posted 30 Jan 2015 by Jen Wike Huger (Red Hat) Feed
How Databricks set a new world record for sorting 100 terabytes (TB) of data, or 1 trillion 100-bytes, in 23 minutes with open source software Apache Spark and public cloud infrastructure EC2.
4 comments Posted 15 Jan 2015 by Reynold Xin Feed
Top 10 open source projects of 2014 with lightbulb
Annual list of top 10 open source projects covered on Opensource.com in 2014. From cloud computing to containers to project management, this year's showing in open source has been phenomenal.
2 comments Posted 16 Dec 2014 by Jen Wike Huger (Red Hat) Feed
From the Apache Quill series: A lightning talk recap about how the Apache Foundation has always done things a certain way at ApacheCon Budapest by Rich Bowen.
3 comments Posted 2 Dec 2014 by Rich Bowen (Red Hat) Feed
2014 has been a record year for Opensource.com. And, we couldn't have done it without you: our readers, writers, community moderators, editors, sharers, and supporters. Thank you. Thanks for making Opensource.com a vibrant community full of amazing stories. We, the open source community, have a... Read more
0 comments Posted 27 Nov 2014 by Jason Hibbets (Red Hat) Feed
Introduction to Apache Hadoop, an open source software framework for storage and large scale processing of data-sets on clusters of commodity hardware.
6 comments Posted 26 Aug 2014 by Sachin P Bappalige Feed
Twitter birds
Twitter has shifted its way of thinking about how to launch a new service thanks to the Apache Mesos project, an open source technology that brings together multiple servers into a shared pool of resources. It's an operating system for the data center. "When is the last time you've seen the fail... Read more
0 comments Posted 7 Aug 2014 by Jason Hibbets (Red Hat) Feed

Pages