Pandas Creator: Probably Never Any Python-Centric Big Data Solutions
At the January 10 Data Day Texas 2015, Wes McKinney, creator of Pandas and author of the book Python for Data Analysis, concluded his presentation with a slide that said:
The time for a "dark horse" Python-centric big data solution has probably passed us by. Maybe better to pursue alliances.
The reasons given include that Big Data is presently JVM-centric -- Java for Hadoop and Scala for Spark -- and Python is not a JVM language. Of course, IPython Notebook will continue to be popular and powerful, and as I've written several times before, with the proper beefy workstation, over 100TB of data can be handled without having to resort to cluster computing.
So unless something changes drastically, when it comes to Big Data, Python will just be a bolt-on technology.