I've blogged about BIDMach before (Single GPU-Powered Node 4x Faster Than 50-node Spark Cluster), which is a much newer project from AMPLab than Spark. But BIDMach, although it has plans for cluster operation, apparently does not have it yet.
Wouldn't it be nice to be able to drag and drop Big Data components and sources together? Shouldn't we have a "Universal Streaming Connector" by now -- similar to USBTM but for data streaming?
Last week, my compatriot gushed over the New Chief Data Scientist of the United States Government, DJ Patil. He was not alone; the guys at the Partially Derivative podcast did as well.
Now, yes, we should be happy that government is not keeping itself in the stone age. But my two specific criticisms of all this gushing are:
Your new job title:
At the Data Science Association, we frequently receive inquiries from companies looking to hire data scientists. There aren't any. OK, even if you can find one: