ApacheCon is coming up, and within that massive conference there will be a glimmering gem: a forum dedicated to Spark. The Spark Forum will have speakers from the Hive project, the Pig project, and the Sqoop project. Plus, two talks about Spark Streaming—one will be introductory, and the other... Read more
Initially, Hadoop implementation required skilled teams of engineers and data scientists, making Hadoop too costly and cumbersome for many organizations. Now, thanks to a number of open source projects, big data analytics with Hadoop has become much more affordable and mainstream. Here's a look at... Read more
Annual list of top 10 open source projects covered on Opensource.com in 2014. From cloud computing to containers to project management, this year's showing in open source has been phenomenal.
What's really accelerating today's pace of change in software is the combinations of many open source parts building on and amplifying each other. It's a dynamic that just isn't possible with proprietary software.
Introduction to Apache Hadoop, an open source software framework for storage and large scale processing of data-sets on clusters of commodity hardware.
Milind Bhandarkar gives a talk about open source and big data at Great Wide Open 2014
There are rapidly growing feature set, high commit rates, and code contributions happening across the globe to Apache Hadoop and related Apache Software Foundation projects. However, the number of woman developers, committers, and Project Management Committee (PMC) members in this vast and... Read more
Massive disruption is occurring as marketing goes digital. Business is moving steadily towards providing a fully personalized and truly integrated digital experience—building upon recent advances in user experience, analytics, cloud computing and storage, and an omni-channel experience across... Read more
In Part I, we discussed the Senate Armed Services Committee (SACS)'s attempt to hobble the open source Accumulo project in the DOD. They directed the Department's CIO to jump through a number of reporting hoops before Accumulo would be allowed inside the DOD, and directed the Accumulo team to... Read more
The dozens of software projects launched in the wake of Google's Big Table and Map Reduce papers have changed the way we handle large datasets. Like many organizations, the NSA began experimenting with these "big data" tools and realized that the open source implementations available at the time... Read more