I’m a huge Hadoop fan. If you love innovation and startups, you HAVE to love Hadoop – simply because it’s fueling so much of both. Hadoop adoption has been particularly rapid among Netezza’s digital media customers (which include many startups). This makes perfect sense given the huge volumes of clickstream data they analyze in order to deliver innovative solutions for problems like ad targeting, ad yield optimization, attribution analysis, and website optimization. Because of this, I think I’ve gotten more questions about Hadoop over the past three years than just about anybody at Netezza.
So I was particularly excited when we kicked off Netezza’s partnership with Cloudera last year. Cloudera’s Distribution for Hadoop (CDH) has demonstrated incredibly rapid market share growth across multiple vertical industries over the past few years, and their team includes many of the world’s top experts on the technology. I’m especially grateful for the work that the Cloudera team has done in helping educate the market about the strengths of Hadoop relative to other technologies typically found in data management architectures. As a result, the most common Hadoop-related question I receive has shifted over the past few years from “Why can’t I replace my RDBMS with Hadoop?” to “How can I most benefit from using Hadoop alongside my RDBMS?”
This week, I’m thrilled that Cloudera has announced the immediate availability of the Cloudera Connector for IBM Netezza appliances, which is the first of its kind for CDH and Cloudera Enterprise and enables high-speed, bilateral data transfer between CDH and Netezza environments. This is a great development for digital media firms (and firms across every other industry) because it increases the value they can create when they leverage the interoperability and integration between these platforms for their own innovation. It also further validates the important work that Cloudera is doing to enable reliable enterprise-class deployments of Hadoop and speaks to the increasing demand among big businesses for the platform capabilities and services that Cloudera provides.
But most importantly, this week’s announcement underscores the market’s increasing recognition of a new generation of data management architectures that enable big data problems to be solved in ways previously not possible. My friend Ed Albanese does a great job of summarizing this next-generation architecture on the Cloudera blog here. And my friend and colleague Krishnan Parasuraman describes the complementary use cases we most frequently see among our digital media customers in the following slides:
The Cloudera Connector for IBM Netezza is free and can be dowloaded here – check it out!