Introduction to Apache Hadoop, an open source software framework for storage and large scale processing of data-sets on clusters of commodity hardware.
Milind Bhandarkar gives a talk about open source and big data at Great Wide Open 2014
There are rapidly growing feature set, high commit rates, and code contributions happening across the globe to Apache Hadoop and related Apache Software Foundation projects. However, the number of woman developers, committers, and Project Management Committee (PMC) members in this vast and... Read more
Massive disruption is occurring as marketing goes digital. Business is moving steadily towards providing a fully personalized and truly integrated digital experience—building upon recent advances in user experience, analytics, cloud computing and storage, and an omni-channel experience across... Read more
In Part I, we discussed the Senate Armed Services Committee (SACS)'s attempt to hobble the open source Accumulo project in the DOD. They directed the Department's CIO to jump through a number of reporting hoops before Accumulo would be allowed inside the DOD, and directed the Accumulo team to... Read more
The dozens of software projects launched in the wake of Google's Big Table and Map Reduce papers have changed the way we handle large datasets. Like many organizations, the NSA began experimenting with these "big data" tools and realized that the open source implementations available at the time... Read more
We are barely into the beginning of cloud computing, so any prediction of what its future will be prone to error. Massive shifts in IT, such as the shift away from client/server into cloud architectures, are a function not only of winning technologies but also of users' behavioral patterns and... Read more