Data Mining Tools: Perl, Matlab, SAS, Pig, Impala, Shark, Clojure, Scalding, Elasticsearch, Spark MLlib, Graphlab, Shogun and Weka
Mo Data stashed this in Big Data Technologies
data munging (“explore”, “clean” and “transform” above) – both exploratory data analysis (EDA) and operational ETL,
visualization – both exploratory and presentational,
We started with tools that I (Szilard) thought must be the most popular, but we also asked what other tools people are using, so we don’t miss any hidden gems.
"R" was top of every list. I have to wonder if it was the "correct" tool for the job or if it's just the "popular" tool of the moment?