Wednesday, May 14, 2014

Notable resources for data analysis

Thoughts and trends in Data Analysis
http://wesmckinney.com/blog/?p=77

Python for Data Analysis
http://pandas.pydata.org/

Mooc
https://www.udacity.com/courses#!/data-science

https://bigdatacourse.appspot.com/preview

https://www.udacity.com/courses#!/data-science

What language to learn in Data Analytics - 5% R, 80% Python

While R is still consider the "right" programming language for data analysis, Python is gaining ground.
I would suggest students spend a few hours on R to get a flavor of it but focus on using Python for actual analysis project for the following reasons:

We cover Python in several classes in our curriculum, there is enough local expertise, including staff, students and faculty, on this tool.  There are a lot of pre-built modules in Python, the use base is large.  Furthermore, because Python is a general purpose language, student can learn it and support their other classes or their future careers.

Survey of MOOC classes on Data Analytics

Courera:
https://www.coursera.org/course/datasci
Looks like a good fit.   But not sure when it will be offered.