DAL MOOC – Week 2 Data Wrangling

Tony Hirst/Sam Easterby-Smith  ©2007/cc-by-nc-sa 2.0
Tony Hirst/Sam Easterby-Smith  ©2007/cc-by-nc-sa 2.0

In this video, Open University academic, Tony Hirst talks about managing and analysing data, following the “4 Steps of Data Wrangling”: Clean, Shape, Augment, and Look. As with the other two videos I’ve reviewed, I’ve followed the spirit of the ‘revise/remix’ ethos of the course and have edited out the glitches (and enlarged the slides).

In summary, Tony provides a brief overview of following data wrangling tools:

He demonstrates Pivot tables and Sankey diagrams, and suggests looking for, outliers, similarities and differences, and trends, when exploring data for visualisation.

Tony also quotes John Tukey’s statement from half a century ago, that computers would allow people to become “journeymen carpenter’s of data analytics” and quotes Leland Wilkinsons’ to support the use of powerful tools to make sense of data and develop data narratives.


See more of Tony’s thinking at blog.ouseful.info and at github.com.

%d bloggers like this: