Cloudera and Hortonworks Merge

This is exciting stuff, so let’s jump right in. First on the financials, which are sure to dominate the headlines. It’s a stock merger with Cloudera stockholders getting 60% stake in the new entity and Hortonworks stockholders obviously getting the other 40%. The combined organization is initially being valued at $5.2 billion, with Hortonworks having found positive cash flow by offering robust support services across the data...

Read More

“Big data, huh, what is it good for?…

The mood of this week’s Hadoop Summit has felt wonderfully diverse. There is a cognitive disconnect between the incremental progress of dot release feature sets and the revolutionary new business and societal applications of the technology. In the same keynote session the topics can swerve from optimizing cluster utilization to optimizing marketing yields to finding a cure for cancer. The technical lectures were packed, while the expo...

Read More

Psycho query: qu’est-ce que c’est?

Any Talking Heads fans reading this blog? Take any French classes in high school? No? Nevermind then. I get asked a lot about SQL on Hadoop, and I know what you’re thinking: “this guy must have the coolest friends and the go to all best parties.” And you’re right, I do. Lenny Kravitz by a rooftop pool in Vegas. Fitz and the Tantrums. Duran Duran. The Astoria Middle School Marching Band on Loyalty Day. (10-year...

Read More

Schrodinger’s Cat and Analytics Accessibility

Everyone loves the concept of Schrodinger’s cat, with the possible exception of a few serious PETA members. The metaphor that an entrapped feline can be both poisoned and/or not poisoned until directly observed is a catchy way to understand uncertainty around various possible states and outcomes. I see a similar problem with big data. Companies are going to great lengths to gather data and sophisticated workflows to analyze it...

Read More
EMC Dips Deeper Into The Shallow End of The Data Lake
Mar23

EMC Dips Deeper Into The Shallow End of The Data Lake

Barely a month after making its first big splash in the Data Lake, EMC is back at it with an all-in-one Big Data analytics solution — hardware, software and services – with availability and pricing to be determined later. The Federation Business Data Lake packages storage and Big Data analytics technologies from EMC Information Infrastructure, Pivotal, and VMware, together with services, to accelerate and automate deployment of...

Read More